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METHOD OF MODULATING METABOLITE BIOSYNTHES ISJN RECOMBINANT 
CELLS 

FIELD OF THE INVENTION 

The present invention relates to the field of modulating the biosynthesis of 
metabolites in cells. More particularly, the invention relates to the cloning and expression 
of transcription factor genes, such as genes coding for AP2 domain class transcription 
factors and the use of such transcription factor for regulating the expression of genes 
involved in the biosynthesis of metabolites or precursors therefor. Especially the invention 
relates to the cloning and expression of AP2 domain class plant transcription factors and 
use hereof for regulating the expression of biosynthetic genes involved in the metabolite 
biosynthesis (i.e. of primary and/or secondary metabolites, and in particular of secondary 
metabolites) in plant cells. 



BACKGROUND OF THE INVENTION AND PRIOR ART 

Generally, two basic types of metabolites are synthesised in cells, i.e. those referred 
to as primary metabolites and those referred to as secondary metabolites. A primary 
metabolite is any intermediate in, or product of the primary metabolism in cells. The 
primary metabolism in cells is the sum of metabolic activities that are common to most, if 
not all. living cells and are necessary for basal growth and maintenance of the cells. 
Primary metabolism thus includes pathways for generally modifying and synthesising 
certain carbohydrates, proteins, fats and nucleic acids, with the compounds involved in the 
pathways being designated primary metabolites. 

In contrast hereto, secondary metabolites usually do not possess a basal function in 
cell growth and maintenance. They are a group of chemically very diverse products that 
often have a restricted taxonomic distribution. Secondary metabolites normally exist as 
members of closely related chemical families, usually of a molecular weight of less than 
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1,500, although some bacterial toxins are considerably larger Two examples of fungal cell 
secondary metabolites are penicillin and ergotamine. 

Plant metabolites include a diverse array of chemically unrelated compounds such 
as carbohydrates and lipids (e.g. mono-, oligo- and polysaccharides, sugar alcohols, 
5 organic acids, fatty acids and lipids, acetylenes and thiophenes), nitrogen-containing 

compounds (e.g. amino acids, amines, glycosides, glucosinolates, purines, pyrimidines and 
polypeptides) of which most, but not all, generally are referred to as primary metabolites. 
Accordingly, some compounds such as fatty acids, sugars and steroids may e.g. be 
categorised both as primary metabolites and secondary metabolites (see e.g. Dewick, P.M., 
10 1997, Medicinal Natural Products A Biosynthetic Approach, John Wiley & Sons, 
Chichester). 

Secondary plant metabolites include e.g. alkaloid compounds (e.g. terpenoid indole 
alkaloids and indole alkaloids), phenolic compounds (e.g., quinones, lignans and 
flavonoids), terpenoid compounds (e.g. monoterpenoids, iridoids, sesquiterpenoids, 

15 diterpenoids and triterpenoids). In addition, secondary metabolites include small molecules 
(i.e., having a molecular weight of less than 600), such as substituted heterocyclic 
compounds which may be monocyclic or polycyclic, fused or bridged. 

Many plant secondary metabolites have value as pharmaceuticals, food colours, 
flavours and fragrances. Plant pharmaceuticals include e.g.: taxol, digoxin, colchicine, 

20 codeine, morphine, quinine, shikonin, ajmalicine and vinblastine. Examples of secondary 
metabolites that are useful as food additives include anthocyanins, vanillin, and a wide 
variety of other fruit and vegetable flavours and texture modifying agents. 

Plant secondary metabolites such as terpenoid indole alkaloids (TIA) represent a 
class of pharmaceutically useful compounds which naturally occur in many plant species. 

25 Some plant secondary metabolites are linked to plant or plant cell defence 

mechanism and may e.g. confer to the plant antimicrobial activity, protection against UV 
light, herbivores, pathogens, insects and nematodes, and the ability to grow at low light 
intensity. 

There are numerous examples of the application of plant secondary metabolites 
30 such as TIA's in medicine. The monomeric alkaloids serpentine and ajmalicine found in 
Catharanthus roseus are e.g. used as tranquillisers and to reduce hypertension, 
respectively. The dimeric alkaloids vincristine and vinblastine, also found in C. roseus, are 
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potent antitumor drugs. Camptothecin, a monomeric TIA found in Camptotheca 
acuminata, also possesses anti-tumor activity. Quinine from Cinchona officinalis is used in 
malaria treatment. 

However, a major problem associated with the industrial use of the above 
5 metabolites is that only very small or variable amounts of these compounds are present in 
plants. The recovery of useful metabolites from their natural sources is thus in many 
instances difficult due to the enormous amounts of source material which may be required 
for the isolation of utilisable quantities of the desired products. As an example, over 500 
kg of Catharanthus roseus is needed to obtain Ig of vincristine. Extraction is both costly 

10 and tedious, requiring large quantities of raw material and extensive use of 

chromatographic fractionation procedures. The low levels of these compounds in plants 
may also imply that many of the compounds are not detected when performing normal 
screening procedures, hence many unknown compounds may exist. 

Biosynthesis of TIA compounds proceeds in most plants via many enzymatic steps. 

15 TIA compounds consist of an indole moiety provided by tryptamine and a terpenoid 

portion provided by the iridoid glucoside compound secologanin. Tryptamine is derived 
from primary metabolism by a single enzymatic conversion of the amino acid tryptophan, 
a reaction catalysed by the enzyme tryptophan decarboxylase (TDC) (EC 4.1.1.28). The 
biosynthesis of secologanin requires a number of enzymatic conversions of which the first 

20 step is the hydroxy lation of geraniol by the enzyme geraniol 10-hydroxyiase (GIOH). 
Tryptamine and secologanin are condensed by the enzyme strictosidine synthase (STR) 
(EC 4.3.3.2) to form strictosidine, which is the general precursor of all terpenoid indole 
alkaloids found in plants. The first enzymatic conversion of strictosidine into strictosidine 
aglucon is catalysed by strictosidine-B-D-glucosidase (SGD) (EC 3.2.1.105). Many 

25 different enzyme activities convert strictosidine aglucon into the large variety of terpenoid 
indole alkaloid end products. 

The best progress on molecular characterisation of the terpenoid indole alkaloid 
pathway has been made with C roseus or Madagascar periwinkle, a member of the 
Apocynaceae family. C roseus cells have the genetic potential to synthesise over a 

30 hundred terpenoid indole alkaloids. The biosynthesis of terpenoid indole alkaloids is 
strongly regulated, and depends on plant cell type and environmental conditions. Their 
biosynthesis is e.g. induced by fungal elicitors, jasmonates and auxin starvation. Many 
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monomeric TIA compounds are found in ail plant organs, but vindoline and vindoline- 
derived dimeric alkaloids are only found in chioroplast-containing plant tissues. 

Until now, only a limited number of genes coding for enzymes involved in 
terpenoid indole alkaloid biosynthesis have been cloned, such as cDNA clones encoding 
5 STR and TDC isolated from C roseus. In addition, cDNA clones for NADPH cytochrome 
P450 reductase (CPR), which is essential for the GlOH-catalysed reaction, and SGD have 
been isolated. 

Gene expression studies by the present inventors have shown that the regulation of 
terpenoid indole alkaloid biosynthesis is controlled largely, if not uniquely, at the level of 

10 the expression of biosynthetic genes. Accordingly, analysis of the expression of the 

terpenoid indole alkaloid biosynthetic genes Tdc and Str\ showed that their expression is 
low, especially in cell cultures. It was found, that the level of gene expression is likely to 
be limiting for alkaloid production. Overexpression of a single biosynthetic gene {Tdc or 
Str\) in transgenic C. roseus cells resulted in significantly elevated levels of the 

15 corresponding enzyme activity. However, this did not result in elevated alkaloid levels, 

presumably because many other enzymes are limiting and would need to be overexpressed. 

These studies have further demonstrated what is already known, that the genes are 
coordinately regulated depending on cell type or environmental conditions. StrX and Tdc 
mRNA accumulate in suspension-cultured cells after auxin starvation or phosphate 

20 starvation, exposure to fungal elicitors or (methyl) jasmonate and their distribution in the 
plant is developmentally regulated with the highest levels in the roots. In leaves Tdc and 
Str \ are induced by a UV-B light pulse, Cpr mRNA accumulation is rapidly induced by 
fungal elicitor. 

The observations that mRNA from genes involved in the biosynthesis of TIA 
25 compounds such as Tdc and Strl mRNAs coordinately accumulate in response to fungal 
elicitors, (methyl)-jasmonate, UV light, auxin depletion, phosphate depletion, and have 
similar tissue-specific distributions, have led the present inventors to hypothesise that the 
Tdc and Strl genes might be controlled by one or more common regulating factor(s) or 
substance(s). It was further hypothesised that among possible regulating factors, 
30 transcription factors could have such a regulating effect. 

It is known that certain transcription factors can regulate complex cell 
differentiation processes in animals involving numerous target genes. A notable example is 
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muscle differentiation, where either one of a set of myogenic bHLH transcription factors 
(MyoD, myogenin, Myf5, MRF4) in combination with the MADS-domain transcription 
factor MEF2 induces muscle cell differentiation and switches on numerous muscle-specific 
genes. Other examples include homeodomain transcription factors in the fruit fly that 
regulate cell processes resulting in the determination of segment identity. 

The pathway that is most extensively studied in plant secondary metabolism at the 
transcriptional level, is the one leading to the formation of the anthocyanin pigments. The 
genes encoding flavonoid biosynthetic enzymes are controlled by a combination of two 
distinct transcription factor species, one of which has homology to the protein encoded by 
the vertebrate proto-oncogene c-Myb, and the other with the vertebrate bHLH protein 
encoded by the proto-oncogene c-Myc. These transcription factors bind to specific 
sequences in the promoters of the target genes. 

The DNA-binding domain of plant MYB proteins consists of two, or for some of 
them, one imperfect repeat(s). The MYC proteins have a bHLH-type (basic helix-loop- 
15 helix) DNA-binding domain and recognise variants of the sequence CANNTG. About ten 
enzymes are involved in the biosynthesis of anthocyanins starting from phenylalanine. In 
maize, the entire set of genes encoding these enzymes are thought to be regulated 
coordinately by the Myc gene R and the Myb gene CI in the aleurone (epidermal layer of 
the kernel endosperm), and by homologous genes in other parts of the plant. 
20 Overexpression of the maize Lc gene encoding a MYC-type regulatory protein in 

Petunia upregulated the whole flavonoid biosynthetic pathway starting from Chs and 
including the earlier and later genes, resuhing among others in intensely pigmented leaves. 
The expression of the general phenylpropanoid genes Pal and C4h was not affected by Lc 
overexpression, indicating that Lc only regulates structural genes in the flavonoid branch. 
25 Introduction of the maize R and CI MYC-type regulators in Arabidopsis intensified 

pigmentation in normally pigmented tissues and induced pigmentation in plant tissues that 
are normally unpigmented. In maize cell suspension, ectopic expression of C/ and R led to 
the accumulation of anthocyanins and a number of other related 3-hydroxy flavonoids. In 
addition, six anthocyanin structural genes that are targets for Cl/R were expressed at high 
30 levels in the transgenic cell line. 

In the plant Arabidopsis thaliana, it has been shown that overexpression of the 
transcription factor CBF-1 that belongs to the AP2 domain class transcription factors 
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resulted in coordinate upregulation of a set of cold-regulated genes. However, there are no 
suggestions that transcription factors of this class may have an effect on the biosynthesis of 
metabolites in plant cells and other cells. 

The present inventors have now discovered that transcription factors having an 

5 AP2 DNA-binding domain are highly useful as central regulators of complex metabolite 
pathways involving numerous target genes for such transcription factors. This discovery 
has opened up for providing novel effective means of generating novel metabolite 
compounds, significantly enhancing the yield of commercially valuable metabolite 
compounds and also for enhancing the tolerance of plants towards exogenous stress factors 

0 and conditions. 

Besides such activation of metabolic pathways for the production of metabolites, it 
is also envisaged that the method of the invention may be used for such purposes as: 

- study of (plant) metabolism, in which the method of the invention may for instance be 
used to activate the metabolic pathway(s) under investigation; 

5 - study of gene expression, including but not limited to expression profiling, in that the 
method of the invention allows (the simultaneous) overexpression of one or more 
genes involved in one or more metabolic pathway(s), the expression or gene products 
of which may then be studied/determined; 

- to determine whether one or more specific (usually secondary) metabolites are formed 
in a plant, by enhancing the levels of said metabolites present in and/or formed by the 
plant (e.g. from a level below the detection limit of the assay/equipment used to a level 
above said limit); 

- lowering or removal of unwanted or toxic metabolites present in a plant or plant 
material; 

- discovery enzymes or regulators thereof involved in primary or secundary metabolism. 
Further applications will become clear to the skilled person on the basis of the 

description given hereinbelow, 

SUMMARY OF THE INVENTION 

Accordingly, the invention relates in one aspect to a method of modulating in a cell 
the expression of one or more genes involved in the biosynthesis of a metabolite or a 
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precursor herefor, the method comprising inserting into the cell a nucleotide sequence 
coding for a transcription factor other than a transcription factor having a bHLH-type or a 
MYB-type DNA-binding domain, the nucleotide sequence is operably linked to at least 
one expression regulating sequence, and/or modifying the expression of a nucleotide 
sequence coding for such a transcription factor already present in the cell, the transcription 
factor is capable of regulating the expression of at least one gene coding for a gene product 
involved in the biosynthesis of said metabolite or precursor, and subjecting the cell to 
conditions where the inserted nucleotide sequence coding for a transcription factor is 
expressed or the expression of the nucleotide sequence already present in the cell is 
modulated. 

In a further aspect the invention relates to a method of modulating the stress 
resistance of a cell, the method comprising inserting into the cell a nucleotide sequence 
coding for a transcription factor, the nucleotide sequence is operably linJced to at least one 
expression regulating sequence, and/or modifying the expression of a nucleotide sequence 
coding for such a transcription factor already present in the cell, the transcription factor is 
capable of regulating the expression of at least one gene coding for a gene product 
involved in the biosynthesis of said metabolite or precursor, and subjecting the cell to 
conditions where the inserted nucleotide sequence coding for a transcription factor is 
expressed or the expression of the nucleotide sequence already present in the ceil is 
modulated, the expression of said nucleotide sequence resulting in a modified 
responsiveness of the cell towards exogenous stress conditions. 

There is also provided a recombinant cell having, relative to its parent cell, 
enhanced or reduced biosynthesis of a metabolite or a precursor therefor, and/or enhanced 
or reduced expression of a gene product involved in metabolite production, the cell 
comprising a nucleotide sequence coding for a transcription factor other than a 
transcription factor having a bHLH-type or a MYB-type DNA-binding domain, said 
transcription factor is capable of regulating the expression of at least one gene coding for a 
gene product involved in the metabolite production, said sequence is inserted into the cell 
and/or its expression is modified by operably linking it to a regulating sequence with 
which it is not natively associated. 

The invention pertains in a still further aspect to a method of producing a 
metabolite including a plant secondary metabolite, the method comprising providing a 
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recombinant cell according to the invention, cultivating the ceil under conditions where the 
nucleotide sequence coding for the transcription factor regulating the expression of at least 
one gene coding for a gene product involved in the metabolite production is expressed, and 
recovering the metabolite. 
5 There is also provided a method of constructing a recombinant cell according to the 

invention, the method comprising the steps of 

(i) identifying in a source cell a nucleotide sequence coding for a transcription factor 
other than a transcription factor having a bHLH-type or a MYB-type DNA-binding 
domain, said transcription factor is capable of regulating in the source cell the 

10 expression of at least one gene coding for a gene product involved in biosynthesis 

of a metabolite or a precursor therefor, 

(ii) isolating said nucleotide sequence; and 

(iii) inserting said isolated nucleotide sequence into a host cell comprising a gene 
coding for a gene product involved in biosynthesis of a metabolite or a precursor 

15 therefor, the insertion of said nucleotide sequence resulting in that the expression of 

the gene of the host cell is modulated relative to the expression level in a host cell 
not comprising the inserted nucleotide sequence. 

In still further aspects there are provided an isolated nucleic acid molecule 
comprising a nucleotide sequence coding for an amino acid sequence selected from the 

20 group consisting of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6, a polypeptide which 
includes an amino acid sequence having at least 40% homology including at least 50%, 
such as at least 60%, at least 70%, at least 80% and at least 90%> to an amino acid sequence 
selected from the group consisting of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6 and 
a polypeptide having the amino acid sequence shown in SEQ ID NO:4, SEQ ID NO:5 or 

25 SEQ ID NO:6. 

In other aspects, the invention pertains to a vector comprising a nucleic acid 
molecule according to the invention, a host cell comprising such a vector, and a method of 
isolating a nucleotide sequence coding for a transcription factor not having a bHLH-type 
or a MYB-type DNA-binding domain, said transcription factor is capable of regulating in a 

30 cell the expression of at least one gene coding for a gene product involved in biosynthesis 
of a metabolite or a precursor therefor, the method comprising the steps of 
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(i) probing a cDNA or genomic library with a probe comprising at least a fragment of 
nucleic acid selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2 
and SEQ ID NO:3 and/or a nucleic acid coding for an amino acid sequence selected 
from the group consisting of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6 
5 (ii) identifying a DNA clone that hybridises with said nucleic acid; and 

(iii) isolating the DNA clone identified in step (ii) wherein the nucleic acid sequence is 
coding for all or a part of said transcription factor 

The invention also relates to a method of isolating a nucleotide sequence coding for 
a transcription factor not having a bHLH-type or a MYB-type DNA-binding doma.n. said 
,0 transcription factor is capable of regulating in a cell the expression of at least one gene 
coding for a gene product involved in biosynthesis of a metabolite or a precursor therefor, 
the method comprising the steps of 

(i) synthesising an oligonucleotide primer set corresponding to at least a fragment of a 
sequence selected from the group consisting of SEQ ID NO: 1, 2 and 3; and 
,5 (ii) amplifying cDN A or genomic DNA using said primer set in a polymerase chain 
reaction wherein the amplified nucleic acid fragment is coding for all or a part of 
said transcription factor to obtain a nucleotide sequence coding for a transcription 
factor not having a bHLH-type or a MYB-type DNA-binding domain. 
The invention also pertains to an isolated nucleic acid which is a nucleotide 
20 sequence as shown in SEQ ID NO: 7 and a mutant, allele or variant hereof and the use of 
this isolated nucleic acid sequence for isolating a nucleotide sequence coding for a 
transcription factor not having a bHLH-type or a MYB-type DNA-binding domain, said 
transcription factor is capable of regulating in a cell the expression of at least one gene 
coding for a gene product involved in biosynthesis of a metabolite or a precursor therefor. 

25 

PREFERRED ASPECTS AND EMBODIMENTS OF THE INVENTION 

Hereinbelow, some preferred but non-limiting aspects and embodiments of the 
invention will be discussed in more detail. 
30 In a first aspect, the invention relates to a method of modulating in a cell the 

level(s) of one or more metabolites, said method comprising providing to the cell at least 
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one transcription factor other than a transcription factor having a bHLH-type or a M YB- 
type DNA-binding domain. 

In another aspect, the invention relates to a method of modulating in a cell the 
expression of one or more genes involved in the biosynthesis of a metabolite or a precursor 
5 herefor, said method comprising providing to the cell at least one transcription factor other 
than a transcription factor having a bHLH-type or a MYB-type DNA-binding domain. 

Also, in yet another aspect, the invention relates to a method of modulating the 
stress resistance of a cell, said method comprising providing to the cell at least one 
transcription factor other than a transcription factor having a bHLH-type or a MYB-type 
10 DNA-binding domain. 

In the above methods, at least one transcription factor is preferably provided to the 
cell by the expression in said cell, under the control of an expression regulating sequence 
operable in said cell, of at least one nucleotide sequence that encodes said at least one 
transcription factor. In particular, this may be achieved by the the steps of: 
15 (a) transforming the cell with a genetic construct, said construct comprising the at least 

one nucleotide sequence encoding the at least one transcription factor, operably linked 
to said expression regulating sequence; 
(b) maintaining the cell under conditions such that the at least one nucleotide sequence 
encoding the at least one transcription factor is expressed in said cell. 
20 Usually the expression regulating sequence used in this method will be 

heterologous to the cell and/or will be an expression regulating sequence with which the at 
least one nucleotide sequence that encodes said at least one transcription factor is not 
natively associated, although in its broadest sense, the invention is not limited thereto. 
Generally, by providing a transcription factor to a cell in accordance with the 
25 method of the invention, the level in the cell of the one or more metabolites, including 

primary metabolites, secondary metabolites or precursors and/or intermediates therefor, is 
enhanced or reduced, e.g. relative to a cell to which said transcription factor is not 
provided. 

Providing a transcription factor to a cell in accordance with the method may also 
30 lead to enhanced or reduced levels of enzymes or enzyme regulating factors involved in 
primary or secundary metabolism. 
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In particular, by the method of the invention, the level in the cell of the one or more 
metabohtes may be enhanced by at least 10%, at least 25% or at least 100%, e.g. relative to 
a cell to which the transcription factor is not provided. 

Alternatively, by the method of the invention, the level in the cell of the one or 
5 more metabolites may be reduced by at least 10%, at least 25%>, at least 50%, or at least 
95%, e.g. relative to a cell to which the transcription factor is not provided 

Also, by the method of the invention, the level of one or more metabolites may be 
enhanced (e.g. by the amounts indicated above), while at the same time, the level of one or 
more other metabolites may be reduced (e.g. by the amounts indicated above). 
,0 Also, by providing a transcription factor to a cell in accordance with the method of 

the invention, the expression in the cell of one or more genes involved in the biosynthesis 
of a metabolite or a precursor herefor is enhanced or reduced, e.g. relative to a cell to 
which the transcription factor is not provided. 

In particular, by the method of the invention, the expression in the cell of one or 
15 more genes involved in the biosynthesis of a metabolite or a precursor herefor may be 
enhanced by at least 10%>, at least 25% or at least 100%, relative to a cell to which the 
transcription factor is not provided. 

Alternatively, by the method of the invention, the expression in the cell of one or 
more genes involved in the biosynthesis of a metabolite or a precursor herefor may be 
20 reduced by at least 1 0%, at least 25% or at least 50%, or at least 95%, relative to a cell to 
which the transcription factor is not provided. 

Also, by the method of the invention, the expression in the cell of one or more 
genes involved in the biosynthesis of a metabolite or a precursor herefor may be enhanced 
(e.g. by the amounts indicated above), while at the same time, the expression in the cell of 
25 one or more other genes involved in the biosynthesis of a metabolite or a precursor herefor 
may be reduced (e.g. by the amounts indicated above). 

The transciption factor used in the invention is preferably a factor of the 
"AP2/EREBP"-family. Generally, these can be defined as transcription factors that contain 
within their amino acid sequence one or more AP2 DNA-binding domains or -regions 
30 (hereinbelow generally referred to as ''AP2 domains'^). 

Such AP2 domains are known per se, for instance from Riechmann J.L., 
Meyerowitz E.M. (1998), Biological Chemistry 379: 633-646; and one or more of the AP2 
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domains described in this reference may be present in the transcription factors used in the 
invention. 

In addition to - or instead of - one or more of the AP2 domains described in the 
Riechmann and Meyerowitz reference, the transcription factors used in the invention may 
5 also contain one or more other AP2 domains. These may be other naturally occuring AP2 
domains or may be mutants, analogs, variants, parts or fragments of any naturally occuring 
AP2 domain, including but not limited to mutants, analogs, variants, parts or fragments of 
the AP2 domains described in the Reichmann and Meyerowitz reference. Such mutants, 
analogs, variants, parts or fragments may be derived from the amino acid sequence of a 
10 naturally occuring AP2 domain by addition, substitution, insertion and/or deletion of one 
or more amino acid residues in the naturally occuring sequence, e.g. as further described 
below. 

Preferably, any such AP2 domain (including mutants, analogs, variants, parts or 
fragments) will have at least 35% amino acid residue identity with at least one of the AP2 
5 domains described by Riechmann and Meyerowitz, such as at least 45%, at least 55%, at 
least 65%, at least 75%), at least 85%, at least 95%) or at least 98%) residue identity. For this 
purpose, the percentage of ""amino acid residu identity of a given sequence to the 
sequence of an AP2 domain described by Reichmann and Meyerowitz may be calculated 
by: '''[the number of amino acid residues in the given sequence that correspond to the 
amino acid residue on the same position in the sequence of the AP2 domain] divided by 
[the total number of amino acid residues in the sequence of the AP2 domain] times 100%"% 
in which an insertion and/or a deletion of an amino acid residue is taken as a difference at a 
single amino acid residue/position. All AP2 domains, including but not limited to mutants, 
analogs, variants, parts or fragments of naturally occuring AP2 domains, that fall within 
this general definition should be considered encompassed within the term ""API domain' 
as used herein. 

As mentioned in the Reichmann and Meyerowitz reference, other suitable AP2 
domains may be easily identified through BLAST searches due to the high degree of 
conservation of the sequence of the AP2 domain. After a suitable AP2 domain has been 
identified, (a nucleotide sequence encoding) said domain may then for instance be isolated 
from its natural source, optionally as part of (a nucleotide sequence encoding) a full 
naturally occuring transcription factor. Also, (nucleotide sequences encoding) mutants. 
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analogs, variants, parts or fragments of such naturally occuring AP2 domains may for 
instance be provided starting from such naturally occurring sequences, e.g. as further 
described below. 

Any AP2 domain(s) used in the invention will generally contain the features of the 
5 AP2 domains as summarized by Riechmann and Meyerowitz (vide page 635, right hand 
column), for which reference is also made to Okamuro et al, Proc.Natl.Acad.Sci.USA, 94, 
7076-7081, both incorporated herein by reference. For instance, any amino acid sequence 
having similarity to any of the two conserved segments of the AP2 domains as defined by 
Riechmann and Meyerowitz - i.e. the YRG element and the RA YD element - could also be 
0 used in the invention. 

Preferably, the AP2 domain(s) present in the transcription factor(s) used in the 
invention is/are selected from the AP2 domains of SEQ ID NO:4, SEQ ID NO:5 or SEQ 
ID NO:6; and/or from amino acid sequences having at least 40% homology, at least 50%, 
at least 60%, at least 70%, at least 80% or at least 90% homology to one of the AP2 
5 domains of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6. 

As used herein, the term ^'homology'' refers to homology at the amino acid level 
both in terms of amino acid identity and similarity. Similarity allows for ''conservative 
variation'\ i.e. substitution of one hydrophobic residue such as isoleucine, valine, leucine 
or methionine with another hydrophobic residue, or the substitution of one polar residue 
with another polar residue, such as arginine with lysine, glutamic acid with aspartic acid, 
or glutamine with asparagine. In the present context, similarity is defined and determined 
using the present version of the BLAST program (Altschul et al. Nucleic Acids Res 
25:3389-3402, 1997). 

The transcription factor(s) used in the invention may contain a single AP2-domain, 
two AP2 domains, or even more than two AP2 -domains. As such, they may for instance 
contain one or more of the AP2 domains described in the Reichmann and Meyerowitz 
reference; one or more other naturally occuring AP2 domains; one or more mutants, 
analogs, variants, parts or fragments of a naturally occuring AP2 domain; or any 
combination thereof Combined, these transcription factors containing one or more AP2 
domains will be referred to hereinbelow as '^AP2 transcription factors'\ 

Besides the one or more AP2 domains, the AP2 transcription factors used in the 
invention preferably also contain one or more of the other elements known per se for 



wo 00/46383 



14 



PCT/NLOO/00075 



transcription factors, and in particular one or more of the other elements known per se for 
AP2 transcription factors, e.g. as described in the Reichmann and Meyerowitz reference. 
These include, but are not limited to, one or more transcription activation domains 
( usually regions of biased amino acid composition such as acidic-rich, glutamine-rich, 
5 proline-rich, and/or serine/threonine-rich regions) and - if more them one AP2 domain is 
present - one or more linking regions or -sequences between the AP2 domains. 

The AP2 transcription factor used in the invention may be a naturally occuring AP2 
transcription factor or may be a "synthetic"AP2 transcription factor, e.g. as described 
below. 

10 Naturally occuring AP2 transcription factors suitable for use in the invention 

include, but are not limited to, the tramscription factors described in the Reichmann and 
Meyerowitz reference or other naturally occuring AP2 transcription factors known per se, 
As mentioned in the Reichmann and Meyerowitz reference, such other AP2 transcription 
factors may be easily identified through BLAST searches due to the high degree of 

15 conservation of the sequence of the AP2 domain(s). 

When a naturally occuring AP2 transcription factor is used, it may be derived from 
any suitable biological source, which will usually be a plant, and in particular 
dicotyledonous plant; and some preferred, but non-limiting sources will be mentioned 
hereinbelow. 

20 Also, when a naturally occuring AP2 transcription factor is used, it may have been 

derived from essentially the same plant as the plant (cell) to which said transcription factor 
is to be provided according to the method of this invention, e.g. from a plant belonging to 
the same variety, species or genus. However, the invention is not limited thereto and in its 
broadest sense, any AP2-transcription factor or nucleotide sequence coding therefor may 

25 be used, irrespective of its origin. 

Preferably, the AP2 transcription factor used is such that it causes, in the cell to 
which it is provided/in which it is expressed: 

- an enhancement of the level(s) in the cell of one or more metabolites, a reduction of the 
level(s) in the cell of one or more (other) metabolites; or a combination thereof, e.g. as 

30 decribed above and/or by the amounts indicated above; and/or 

- an enhancement of the expression in the cell of one or more genes involved in the 
biosynthesis of a metabolite or a precursor herefor, a reduction of the expression in the 
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cell of one or more genes involved in the biosynthesis of a metabolite or a precursor 
herefor; or a combination thereof, e.g. as decribed above and/or by the amounts 
indicated above. 

As mentioned in the Riechmann and Meyerowitz reference, in nature AP2 
transcription factors - i.e. with one, two or more than two AP2 domains - may for 
instance be involved in the environmental (light, temperature, etc.) and/or stress 
response(s) by a plant (cell); in the growth and/or development of a plant, and similar 
biological responses in or by a plant. 

Thus, according to one particular embodiment, the transcription factor may be a 
naturally occuring transcription factor, and in particular a naturally occuring AP2 
transcription factor, that is involved in the (intracellular) response of a plant (cell) to 
external stress, including but not limited to response(s) to a pathogen (e.g. bacteria, 
viruses, fungi); to physical damage including but not limited to damage caused by 
herbivores, insects, worms and other multicellular organisms; to lack of (sufficient) water; 
to lack of (sufficient) oxygen; to lack of one or more nutrients; to lack of (sufficient) light; 
to cold; to heat; to high level(s) of salt(s); to UV; to ozone; to xenobiotics; etc.. 

Usually, these will be transcription factors containing a single AP2 domain, 
although the invention in its broadest sense is not limited thereto. Also, mutants, analogs, 
variants, parts or fragments etc. (e.g. as described below) of such naturally occuring 
20 transcription factors may be used. 

Generally, the response(s) of a plant to such external stress will be mediated by one 
or more stress-related signalling molecules including but not limited to jasmonates, 
ethylene, salicylates, abscisic acid, gibberelic acids, nitrogen oxide, hydrogen peroxide, 
reactive oxygen species, fluctuations in imracellular calcium or other ion concentrations, 
25 phospholipids, phosphoinositides, cyclic ADP-ribose. 

Thus.Jn a more particular embodiment, the transcription factor may be a naturally 
occuring transcription factor, and in particular a naturally occuring AP2 transcription 
factor, that is involved in the (intracellular) response of a plant (cell) to a stress related 
signalling molecule; and more in to particular may be a naturally occuring transcription 
factor that is involved in the (intracellular) response of a plant (cell) to a jasmonate, to a 
salicylate, to ethylene, to abscisic acid, gibberelic acids, nitrogen oxide, hydrogen 
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peroxide, reactive oxygen species, gsimma-amino-butyric-acid, fluctuations in intracellular 
calcium or other ion concentrations, phospholipids, phosphoinositides, cyclic ADP-ribose. 

Again, these will usually be transcription factors containing a single AP2 domain, 
although the invention in its broadest sense is not limited thereto; and again mutants, 
5 analogs, variants, parts or fragments etc. (e.g. as described below) of such naturally 
occuring transcription factors may also be used. 

Preferably, the transcription factor is a naturally occuring transcription factor, and 
in particular a naturally occuring AP2 transcription factor, that is involved in the 
intracellular response of a plant (cell) to a jasmonate. [In this respect, it is to be understood 
10 that the term ''jasmonate'' does not only include jasmonate/jasmonic acid, but also 
precursors thereof and other jasmonate-like signalling molecules such as methyl 
jasmonate, 12-oxo-PDA and other lipid-derived plant signal molecules]. Again, these 
transcription factors will usually be AP2 transcription factors containing a single AP2 
domain, although the invention in its broadest sense is not limited thereto; and again 
15 mutants, analogs, variants, parts or fragments etc. (e.g. as described below) of such 
naturally occuring transcription factors rriay also be used. 

Some specific, non-limiting examples of such naturally occuring AP2 transcription 
factors suitable for use in the invention include ORCAl, ORCA2, ORG A3, AtERFl, 
AtEBP, AtCBFl, AtDREBl, AtDREB2A, AtDREB2B, TINY, NtEREBP 1,2,3,4, 
20 LePti4,5,6, ABM. 

Of these, the following transcription factors are preferred AtERFl, AtEBP, 
AtCBFl, AtDREBl, NtEREBP 1,2,3,4, LePti4,5,6, ORCAl, ORCA-2 and ORCA3, of 
which ORCA-2 and ORCA3 are most preferred. 

Also, in the invention, naturally occuring transcription factors, and in particular 
25 naturally occuring AP2 transcription factors, may be used that are involved in the growth 
and/or development of a plant, such as the growth and/or development of the flower, of 
seed, etc.. Usually, these will be AP2 transcription factors containing two AP2 domains, 
although the invention in its broadest sense is not limited thereto. Also, again mutants, 
analogs, variants, parts or fragments etc. (e.g. as described below) of such naturally 
30 occuring transcription factors may also be used. 

Some non-limiting examples of such transcription factors involved in plant 
development include: APETALA2, AINTEGUMENTA, TINY. 
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The term "'synthetic"" API transcription factor as used herein generally comprises 
any transcription factor that contains at least one AP2 domain as defined above and that 
does not occur in nature. These include, but are not limited to; 

- transcription factors that contain one or more mutants, analogs, variants, parts or 

5 fragments of a naturally occuring AP2 domain, optionally in combination with one or 

more naturally occuring AP2 domains; 

- AP2 transcription factors that contain at least two naturally occuring AP2 domains 
which in nature are not natively associated; 

- transcription factors that comprise - compared to the sequence of a naturally occuring 
10 AP2 transcription factor- one or more amino acid insertions, deletions and/or 

substitutions at one or more amino acid positions outside the AP2 domain(s) present 
therein, such as in the transcription activation domain(s) and/or in the linking 
regions/sequences; 

- transcription factors in which one or more AP2 domains are combined with one or 
15 more transcription activation domains and/or with one or more linking sequences 

(where appropriate) which which in nature they are not natively associated; 
or any combination thereof. 

According to one embodiment, such synthetic AP2 transcription factors may be 
mutants, analogs, variants, parts or fragments of the abovementioned naturally occuring 
20 AP2 transcription factors. Such mutants, analogs, variants, parts or fragments may for 
instance be derived from the amino acid sequence of such a naturally occuring AP2 
transcription factor by addition, substitution, insertion and/or deletion of one or more 
amino acid residues, e.g. in the part(s) of the sequence of the transcription factor 
corresponding to the AP2 domain(s); in one of the other parts of the sequence of the 
25 transcription factor sequence such as the transcription activation domain(s) and/or linking 
sequences; or in both. 

Although generally, the parts or segments of the AP2 transcription factor other than 
the AP2 domain(s) will usually not be critical; however, any synthetic AP2 transcription 
factor used in the invention may have at least 20% amino acid residue identity with at least 
30 one of the naturally occuring AP2-transcription factors, such as at least 35%, at least 45%, 
at least 55%, at least 65%, at least 75%, at least 85%, at least 95% or at least 98% residue 
identity. For this purpose, the precentage of '"amino acid residue identity"' of a synthetic 
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sequence to the sequence of a naturally occuring AP2 transcription factor may be 
calculated by dividing "[the number of amino acid residues in the given sequence thai 
correspond to the amino acid residu on the same position in the sequence of the AP2 
transcription factor] by [the total number of amino acid residues in the sequence of the 
5 AP2 transcription factor] x 100%", in which a substitution, an insertion and/or a deletion 
of an amino acid residu is taken as a difference at a single amino acid position. 

Any "synthetic"AP2 transcription factor, including but not limited to mutants, 
analogs, variants, parts or fragments of naturally occuring AP2 transcription factors, that 
fallswithin this general definition should be considered encompassed within the term 
10 transcription factor " as used herein. However, any such synthetic AP2 transcription factor 
is preferably such that it causes, in the cell to which it is provided/in which it is expressed: 
- an enhancement of the level(s) in the cell of one or more metabolites, a reduction of the 
level(s) in the cell of one or more (other) metabolites; or a combination thereof, e.g. as 
decribed above and/or by the amounts indicated above; and/or 
15 - an enhancement of the expression in the cell of one or more genes involved in the 

biosynthesis of a metabolite or a precursor herefor, a reduction of the expression in the 
cell of one or more genes involved in the biosynthesis of a metabolite or a precursor 
herefor; or a combination thereof, e.g. as decribed above and/or by the amounts 
indicated above. 

20 In this respect, it will be clear to the skilled person that a transcription factor which 

can provide such enhancement and/or reduction in one (type oO plant cell may not 
necessarily provide the same level of (or even any) such enhancement or reduction in 
another (type oO plant cell, depending upon the transcription factor and plant cell used. 
Also, it may be that the same transcription factor may enhance or reduce different 

25 metabolites and/or genes in different plant cells, and may even enhance a metabolite or 
gene in one cell but reduce it in another. All this will generally be incorporated within the 
scope of the invention, as long as the transcription factor used is capable of providing the 
enhancement or reduction described above in at least the intended plant cell. 

Usually, according to the invention, an AP2 transcription factor as described above 

30 will be provided to said intended plant (cell) by means expression in said plant (cell) of a 
nucleotide sequence encoding said AP2 transcription factor. 
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Preferably, such a nucleotide sequence encoding the AP2 transcription factor is 
provided to the plant (cell) in the form of a genetic construct, in which the sequence 
encoding the AP2 transcription factor is operably linked (as defined below) to a suitable 
promoter (e.g. as described below). Such constructs, which are a further aspect of the 
5 present invention, may be in a form suitable for transforming a plant, such as a vector or 
plasmid, and may contain all other elements of genetic constructs for the transformation of 
plants known per se, e.g. those described below. 

Upon transformation of the plant (cell) with such a construct, the AP2 transcription 
factor may then be expressed in the plant (cell), e.g. by keeping the plant under conditions 
10 suitable for expression, which will usually depend upon the promoter used, and which may 
for instance include exposure of the plant (cell) to a suitable inducing factor or compound. 

Upon such expression, the AP2 transcription factor may be subject to one or more 
post-translational modifications, e.g. those that may occur in the plant (cell) in which the 
AP2 transcription factor is expressed. Thus, the term "AP2 transcription factor'' as used 
15 herein does not only comprise any amino acid sequence directly obtained upon expression 
of the nucleotide sequence encoding an AP2 transcription factor, but also all derivatives 
thereof obtained through one or more post-translational modifications. 

For preparing the genetic constructs to be used in the invention, nucleotide 
sequences encoding naturally occuring AP2 transcription factors and/or AP2 domains may 
20 be derived from a suitable source, e.g. those described above for the AP2 transcription 
factors themselves. 

Nucleotide sequences encoding "synthetic"AP2 transcription factors (as described 
above) for use in the invention may for instance be provided by insertion, deletion, 
addition or substitution of one or more codons - or even insertion, deletion, addition or 

25 substitution, and in particular substitution, of one or more bases/nucleotides - in or 

compared to a naturally occuring sequence of an AP2 transcription factor; by combining 
two or more parts of two or more different naturally occuring AP2 domains or AP2 
transcription factors; and/or by combining one or more parts of one or more different 
naturally occuring AP2 domains or AP2 transcription factors with one or more synthetic 

30 nucleotide sequences; or any combination thereof. This may for instance be carried out 
starting from one or more such naturally occuring sequences (or one or more parts 
thereoO, e.g. by using well known techniques of genetic manipulation, including but not 
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limited to site-directed mutaganesis, introducing mutations using mismatched PCR 
primers, as well as cutting and ligating of nucleic acid sequences. Alternatively, such 
sequences or parts thereof may be prepared using well-known DNA-synthesis techniques, 
although this is usually not preferred. 
5 The synthetic AP2 transcription factor encoded by said sequence may then be 

provided to the plant (cell) by transformation and expression - e.g. essentially as described 
above - again optionally followed by one or more post-translational modifications. 

Suitable techniques for providing the abovementioned nucleotide sequences and 
genetic constructs, as well as techniques for the transformation of plants using such 

10 constructs, will be well known to the skilled person, and some examples thereof are 

mentioned below. Reference is also made to the standard handbooks, such as Sambrook et 
al, "Molecular Cloning: A Laboratory Manual" ( 2nd.ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory (1989) of F. Ausubel et al, eds., "Current protocols in molecular biology", 
Green Publishing and Wiley Interscience, New York (1987). 

15 One well-known technique for transformation of plants involves the use of 

Agrobacterium strains and vector systems for use therewith. Thus, one aspect of the 
invention comprises an Agrobacterium strain containing a genetic construct/vector as 
described herein, as well as the use thereof in transforming a plant or plant cell. 

In the invention, it was found that the expression in a plant (cell) of a naturally 

20 occuring or synthetic AP2 transcription factor - e.g. using a genetic construct as described 
herein - may be used to regulate one or more metabolic pathways in the plant. In 
particular, it was found that such expression of an AP2 transcription factor may be used to 
regulate simultaneously several of such metabolic pathways. These include metabolic 
pathways leading to one or more primary metabolites, pathways leading to one or more 

25 secondary metabolites; pathways leading to and/or from one or more key intermediates of 
the plant metabolism and/or precursors of primary and/or secondaty metabolites; or any 
combination thereof. 

In particular, it was found that the expression in plant (cell) of an AP2 transcription 
factor in accordance with the invention may be used to regulate one or more metabolic 
30 pathways leading to one or more secondary metabolites, including but not limited to those 
mentioned hereinabove and below. 
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Such regulation of the metabolic pathway(s) may alter or otherwise influence the 
entire metabolic pathway(s) - e.g. compared to the native situation, e.g. without exposure 
to stress - but also one or more separate steps (e.g. enzymatic conversions) thereof, which 
may include enzymatic conversions belonging to the same or to different metabolic 
5 pathways. 

Furthermore, such regulaton may comprise an up-regulation of one or more of such 
enzymatic conversions; a down-regulation of one or more of such enzymatic conversions 
involved in said metabolic pathway; or any combination thereof 

Also, such regulation may result in increased formation and/or intracellular level of 

10 one or more metabolites ( including primary as well as secondary metabolites); to 

decreased formation and/or intracellular level of one or more metabolites ( again including 
primary as well as secondary metabolites); or any combination thereof Furthermore, such 
regulation may (also) result in increased formation and/or intracellular levels of one or 
more (key) intermediates of the plant metabolism and/or precursors for primary and/or 

1 5 secondary metabolites; in decreased increased formation and/or intracellular levels of one 
or more (key) intermediates of the plant metabolism and/or precursors for primary and/or 
secondary metabolites; or any combination thereof 

Generally, the number of enzymatic steps/conversions that are modulated in 
accordance with the invention will depend upon the biosynthetic route(s) involved, e.g. the 

20 biosynthetic route(s) leading to - or otherwise involved in the production of - the 

metabolite(s) the level of which is to be modulated. As such, one or more steps of any such 
route may be modulated, and up to all steps involved in the biosynthesis and/or the 
precursor pathways of the metabolite. 

Similarly, also the number of genes the expression of which is modulated in 

25 accordance with the invention may depend upon the metabolic pathway involved. 

According to one aspect of the invention, providing a transcription factor to a cell 
via the method of the invention simultaneously modulates at least two, preferably at least 
three, more preferably at least 5, for instance between 2 and 50, enzymatic conversions - 
e.g. starting from a given precursor - and up to to all enzymatic steps involved in the 

30 formation of the desired metabolite; in which said conversions may be part of one or more 
metabolic pathways in the cell. 
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Similarly, according to one aspect of the invention, providing a transcription factor 
to a cell via the method of the invention simultaneously modulates at least tv^o, preferably 
at least three, more preferably at least 5 genes encoding expression products - e.g. enzymes 
- involved in said metabolic pathways, and up to all genes involved in said metabolic 
5 pathway(s). 

Generally, the enzymatic conversions that may be regulated in accordance with the 
invention include those conversions that can be regulated by the presence/expression in the 
cell of an AP2 transcription factor. Generally, these will be conversions that are catalysed 
or otherwise mediated by polypeptides or proteins (e.g. enzymes) the expression of which 
10 may be regulated, altered or otherwise influenced - and in particular be enhanced - by the 
presence/expression in the cell of an AP2 transcription factor. Some non-limiting examples 
of such conversions are mentioned below. 

Often, whether the expression of such a protein or polypeptide may be influenced 
by an AP2 transcription factor will depend upon the genetic make up of the gene involved. 
15 and in particular also on the promoter involved - e.g. that precedes - said gene. In principle, 
the invention may be used to modulate, and in particular to enhance, the expression of any 
and all native nucleotide sequence(s) or gene(s) that is/are preceded by, operably linked to 
and/or under the control of an expression regulating sequence - e.g. promoter- that contains 
a binding region for the transcription factor provided, e.g. such that the transcription factor 
20 may modulate, and in particular enhance, the expression of said gene. 

However, the invention is not limited to such promoter-mediated modulation, but 
generally encompasses any mechanism via which a transcription factor may modulate the 
gene expression and/or the metabolism. For instance, this may involve binding to the DNA 
- e.g. genomic DNA - at other domains or regions usually close to (the part of the DNA 
25 encoding) the gene to be modulated, including but not limited to intron sequences, mRNA 
leader sequences, to regions of domains upstream - i.e. 3' compared to - the gene, etc.. The 
modulation may also involve binding of the transcription factor to the part of the DNA 
encoding the gene to be modulated itself. 

Some non-limiting examples of such expression regulation sequences/promoters 
30 that may be regulated by the transcription factors provided to the cell by the method of the 
invention include the promoters of the strictosidine synthase and tryptophan decarboxylase 
genes, as well as any other promoter of any other gene the expression of which is 
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modulated - e.g. increased - by (over)expression of ORCA2 and/or ORCA3, including but 
not limited to the promoters of the Cpr, GlOh, D4h, AS and/or DXS-genes. 

As such, the invention may be used to regulate the expression of all genes that in 
the cell are natively under the control of such an expression regulating sequence; but also 
of genes that are not natively associated with such an expression regulating sequence, but 
that have been brought under the control of such a sequence. The latter may for instance 
include genes or other nucleotide sequences - homologous or heterologous to the plant 
(cell) - that are operably linked to such an expression regulating sequence as part of a 
genetic construct used to transform the plant (cell). 

As further described below, the plant (cell) to which the (nucleotide sequence 
encoding) the AP2 transcription factor is provided - e.g. in which said factor is expressed 
in accordance with the method of the invention - may be present in or derived from any 
naturally occuring plant, including but not limited to any part, tissue or organ thereof. The 
plant cell may also be present in/derived from a transgenic plant.. 

For instance, the method of the invention may be applied to any plant - or part, 
tissue, organ or cell of such a plant - in which the metabolism is to be modulated, and in 
particular enhanced, e.g. for the (increased) production by the plant (cell) of one or more 
primary and/or secondary metabolites. 

These plants may include, but not limited to, plants or plant cells of agronomically 
20 important crops, such as maize, rice, potato, wheat, cotton, soybean, grapevine, tomato, 
tobacco, canola, pepper, diverse herbs such as oregano, basilicum, mint. 

It may also be applied to plants that produce valuable compounds, e.g. useful as for 
instance pharmaceuticals, as ajmalicine, vinblastine, vincristine, ajmaline, reserpine, 
rescinnamine, camptothecine, ellipticine, quinine, and quinidine, taxol, morphine, 
scopolamine, atropine, cocaine, sanguinarine, codeine, genistein, daidzein, digoxm, 
colchicine, or as food additives, such as anthocyanins, vanillin; including but not limited to 
the (classes of) compounds mentioned above. Examples of such plants include, but not 
limited to, Papaver spp, Rauvolfia spp, Taxus spp. Cinchona spp, Eschscholtzia 
califomica, Camptotheca acuminata, Hyoscyamus spp, Berberis spp, Coptis spp. Datura 
spp, Atropa spp, Thalictrum spp, Peganum spp. 

The method of the invention may also be used to provide the plant or any part, 
tissue or organ thereof - including but not limited to leaves, stalks, stems, roots, petals. 
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flowers, reproductive organs, as well as fruits, tubers, bulbs and/or seeds - with enhanced 
or otherwise modified taste, flavour, colour, scent or other desirable properties, e.g. due to 
increased levels of one or more metabolites that caused or or otherwise associated with 
said propetries. 

Furthermore, as already mentioned, the method of the invention may be used to 
improve the stress resistance of a plant or plant cell, e.g. against the stress factors 
mentioned above. In one embodiment of the invention, such increased stress resistance 
may (at lest in part) be the result of/be caused by increased levels of certain metabolites in 
the plant (cell). For example, increased levels of proline, certain sugars or other "anti- 
freeze" components may lead to improved resistance against cold, whereas increased 
levels of certain other secondary metabolite may lead to increased resistance against UV- 
radiation, etc.. 

Also, such increased levels of certain metabolites - and in particular of certain 
secondary metabolites - may make the plant or any part, tissue, organ or material thereof - 
including but not limited to leaves, stalks, stems, roots, petals, flowers, reproductive 
organs, as well as fruits, tubers, bulbs and/or seeds - more resistant against biological 
pathogens and similar stress factors, including but not limited to bacteria, nematodes, 
insects and/or other pests, and/or herbivores. As is known, certain secondary metabolites - 
e.g. toxic or bad tasting metabolites - may defend the plant or plant material against 
damage or attack by such pathogens, in that they may make the plant or plant material less 
attractive as a food source. According to the invention, the level(s) of such metabolites 
may be (further) increased - e.g. compared to the native levels in the plant - to reduce or 
even prevent attack by such pathogens. 

From the above, it will be clear that the method of the invention may more 
generally be used to enhance or otherwise influence any property of a plant that is caused 
by, or at least associated with, the levels of one or more secondary metabolites in the plant. 

The method of the invention may also be used to provide (increased amounts of) 
one or more of the enzymes involved in the metabolic pathways in the plant, which may 
then be isolated from the plant and then put to further use(s). 

According to the invention, the modulation of the metabolism - and thereby the 
enhanced production of the metabolites - may occur throughout the entire plant, or only in 
one or more specific parts, tissues, organs and or cells of the plant, including but not 
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limited to leaves, stalks, stems, roots, petals, flowers, reproductive organs, as well as fruits, 
tubers, bulbs and/or seeds. The modulation of the metabolism may also occur during a 
specific phase of the plant life and/or plant development. 

For this purpose, the transcription factor(s) of the invention may also be expressed 
in the plant cell under the control of a tissue specific promoter, e.g. specific for expression 
in leaves, stalks, stems, roots, petals, flowers, reproductive organs, fruits, mbers, bulbs and 
seeds; or under the control of a promoter that is specific for a certain phase of the 
development of the plant. The transcription factor(s) of the invention may also be 
expressed in the plant cell under the control of promoters that are activated by 
environmental signals, including but not limited to stress-related signals, light, 
exogenously added chemicals, pathogens or pathogenic signals. Examples of such 
promoters will be clear to the skilled person. 

Thus, according to the invention, the cell to which the AP2 transcription factor is 
provided may be in the form of - i.e. may be present in - a cell culture (e.g. in vitro); or 
may be present in a plant in vivo, including but not limited to any tissue, part or organ of 
such a plant, such as the leaves, stalks, stems, roots, petals, flowers, reproductive organs, 
as well as fruits, tubers, bulbs and/or seeds. 

In one specific embodiment, the one or more enzymatic conversion that are 
modulated by the expression of the AP2 transcription factors in accordance with the 
invention are enzymatic conversions that form part of the metabolic pathway(s) which in 
nature - e.g. in that plant (cell) as it natively occurs - are modulated or influenced by 
external stress. 

In particular, such enzymatic conversion(s) may form part of the metabolic 
pathway(s) which in nature are modulated or influenced by the (extracellular) presence of 

>5 one or more of the signalling molecules, including but not limited to jasmonic 

acid/jasmonate, ethylene, salicylic acid/salicylate, abscisic acid, gibberelic acids, nitrogen 
oxide, hydrogen peroxide, reactive oxygen species, gamma-amino-butyric-acid, 
fluctuations in intracellular calcium or other ion concentrations, phospholipids, etc.. 

More in particular, such enzymatic conversion(s) may form part of the metabolic 

JO pathway(s) which in nature are modulated or influenced by the (extracellular) presence of 
jasmonates, e.g. Again, these may be metabolic pathways leading to the formation of 
primary and/or to the formation of secondary metabolites. 
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Usually, in order to regulate one or more of the stress-related or stress factor- 
mediated enzymatic conversions/metabolic pathways, an AP2 transcription factor will be 
used that contains a single AP2 domain. 

In particular, for this purpose, an AP2 transcription factor may be used that 
contains an AP2 domain of an AP2 transcription factor that in nature is regulated by 
external stress or the (extracellular) presence of one of the signalling molecules mentioned 
above; or that contains a mutant, analog, variant, part or fragment of such an naturally 
occuring AP2 domain. Such AP2 domains are described above. 

More in particular, the AP2 transcription factor may be used that contains an AP2 
domain of a transcription factor that is regulated by the (extracellular) presence of one or 
more jasmonates; or that contains a mutant, analog, variant, part or fragment of such an 
naturally occuring AP2 domain. Again, such AP2 domains are described above. 

Thus, in another aspect, the invention relates to a method of modulating in a cell 
the expression of one or more genes involved in the biosynthesis of a metabolite or a 
1 5 precursor herefor, the method comprising inserting into the cell a nucleotide sequence 

coding for a transcription factor other than a transcription factor having a bHLH-type or a 
MYB-type DNA-binding domain, the nucleotide sequence is operably linked to at least 
one expression regulating sequence, and/or modifying the expression of a nucleotide 
sequence coding for such a transcription factor already present in the cell, the transcription 
20 factor is capable of regulating the expression of at least one gene coding for a gene product 
involved in the biosynthesis of said metabolite or precursor, and subjecting the cell to 
conditions where the inserted nucleotide sequence coding for a transcription factor is 
expressed or the expression of the nucleotide sequence already present in the cell is 
modulated. 

25 In yet another aspect, the invention relates to a method of modulating the stress 

resistance of a cell, the method comprising inserting into the cell a nucleotide sequence 
coding for a transcription factor, the nucleotide sequence is operably linked to at least one 
expression regulating sequence, and/or modifying the expression of a nucleotide sequence 
coding for such a transcription factor already present in the cell, the transcription factor is 

30 capable of regulating the expression of at least one gene coding for a gene product 

involved in the biosynthesis of said metabolite or precursor, and subjecting the cell to 
conditions where the inserted nucleotide sequence coding for a transcription factor is 
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expressed or the expression of the nucleotide sequence already present in the cell is 
modulated, the expression of said nucleotide sequence resulting in a modified 
responsiveness of the cell towards exogenous stress conditions. 

In a further aspect, the invention relates to a recombinant cell having, relative to its 
5 parent cell, enhanced or reduced biosynthesis of a metabolite or a precursor therefor, 
and/or enhanced or reduced expression of a gene product involved in metabolite 
production, the cell comprising a nucleotide sequence coding for a transcription factor 
other than a transcription factor having a bHLH-type or a MYB-type DNA-binding 
domain, said transcription factor is capable of regulating the expression of at least one 

!0 gene coding for a gene product involved in the metabolite production, said sequence is 

inserted into the cell and/or its expression is modified by operably linking it to a regulating 
sequence with which it is not natively associated. 

According to the above aspects of the invention, the cell is preferably a plant cell, 
more preferably of a dicotyledonous species. In particular, the cell may be of a species 

15 selected from the group consisting of the Gentianales order and the Cornales order and/or 
of a species selected from an indole alkaloid producing species. Some non-limiting 
preferred examples thereof include species selected from the group consisting of species 
belonging to the Apocynaceae, Alangiaceae, Loganiaceae, Icacinaceae. Cornaceae and 
Rubiaceae families, and the genus Catharanthus, which is particularly preferred. Most 

20 preferred is a Catharanthus roseus cell. 

The plant cell may be in the form of a cell culture (e.g. in vitro); or may be present 
in a plant tissue and/or in any part or organ of the plant (e.g. in vivo), including but not 
limited to the leaves, stalks, stems, roots, petals, flowers, reproductive organs, fruits, 
tubers, bulbs and seeds. 

25 The nucleotide sequence coding for the transcription factor is preferably derived 

from a plant cell, more preferably from a cell derived from a dicotyledonous species. 

In particular, the nucleotide sequence coding for the transcription factor may be 
derived from a cell of a species selected from the group consisting of the Gentianales order 
and the Cornales order and/or of a species selected from an indole alkaloid producing 

30 species. Some non-limiting preferred examples thereof include species selected from the 
group consisting of species belonging to the Apocynaceae, Alangiaceae, Loganiaceae, 
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Icacinaceae, Cornaceae and Rubiaceae families, and the genus Catharanthus, which is 
particularly preferred. Most preferred is a Catharanthus roseus cell. 

Preferably, the nucleotide sequence is a sequence coding for a transcription factor 
comprising at least one AP2 domain, including at least two AP2 domains and at least three 
5 AP2 domains. 

For instance, the nucleotide sequence may be a sequence comprising at least one 
sequence coding for an AP2 domain selected from the group consisting of AP2 domains of 
SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6; and/or a sequence coding for an amino 
acid sequence selected from the group consisting of SEQ ID NO:4, SEQ ID NO:5 and 
10 SEQ ID NO:6. 

Also, the nucleotide sequence may be a sequence coding for an amino acid 
sequence comprising an at least one AP2 domain having at least 40% homology, at least 
50%, at least 60%, at least 70%, at least 80% or at least 90% homology to any of the at 
least one AP2 domains of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6. 
15 Alternatively, the nucleotide sequence may comprise at least one sequence coding 

for an AP2 domain of a sequence selected from the group consisting of SEQ ID NO: 1 , 
SEQ ID NO:2 and SEQ ID NO:3. The nucleotide sequence coding for the at least one AP2 
domain may also be a mutant, allele or variant of a nucleotide sequence coding for an AP2 
domain of a sequence selected from the group consisting of SEQ ID NO: I, SEQ ID NO:2, 
20 and SEQ IDNO:3. 

The nucleotide sequence coding for a transcription factor may be a homologous 
nucleotide sequence or a heterologous nucleotide sequence, e.g. to the plant (cell) to which 
it is provided. 

Preferably, the nucleotide sequence is operably linked to a regulating nucleotide 
25 sequence with which it is not natively associated, as further described below. 

The metabolite is preferably a plant metabolite. As such it may be a primary plant 
metabolite, including but not limited to a primary plant metabolite selected from the group 
consisting of tryptophan or precursors therefor; terpenoid(s) including but not limited to 
geraniol; and precursors therefor. 
30 The plant metabolite may also be a secondary plant metabolite. For instance, such a 

secondary metabolite may be selected from the group consisting of alkaloid compounds, 
phenolic compounds and terpenoid compounds, eind in particular derived from a compound 
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selected from the group consisting of tryptophan, tryptamine, tyrosine, lysine, ornithine, 
nicotinic acid, anthranilic acid and acetate, although the invention in its broadest sense is 
not limited thereto. 

The secondary metabolite may also be an indole alkaloid such as a terpenoid indole 
5 alkaloid, including but not limited to pharmacologically active terpenoid indole alkaloids 
such ajmalicine, vinblastine, vincristine, ajmaline, reserpine, rescinnamine, camptothecine, 
ellipticine, quinine, and quinidine; and/or a precursor therefor, such as strictosidine. 

The gene product involved in metabolite production may be a protein including but 
not limited to an enzyme, such as an enzyme involved in an alkaloid biosynthetic pathway. 
1 0 For instance, the enzyme may be selected from the group consisting of Anthranilate 
synthase (ASA); D-1 deoxyxylulose 5-phosphate synthase (DXS); Geraniol 10- 
hydroxylase (GIOH); NADPHxytochrome P450 reductase (CPR); Tryptophan 
decarboxylase (TDC); Strictosidine synthase (STR); Strictosidine p-D-glucosidase (SOD); 
Desacetoxyvindoline 4-hydroxylase (D4H); Acetyl-CoA:deacetylvindoline 4-0- 
1 5 acetyltransferase (DAT). 

Preferably, the expression in the cell of one or more genes involved m the 
biosynthesis of a metabolite or a precursor herefor is enhanced or reduced relative to a cell 
into which a nucleotide sequence coding for a transcription factor is not inserted or which 
is without modified expression of the already present nucleotide sequence coding for a 

20 transcription factor. 

For instance, in one embodiment, the expression in the cell of one or more genes 
involved in the biosynthesis of a metabolite or a precursor herefor may be enhanced by at 
least 10%, at least 25% or at least 100%, relative to a cell into which a nucleotide sequence 
coding for a transcription factor is not inserted or which is without modified expression of 
25 the already present nucleotide sequence coding for a transcription factor. 

In another embodiment, the expression in the cell of one or more genes involved in 
the biosynthesis of a metabolite or a precursor herefor may be reduced by at least 10%, at 
least 25%> or at least 50%, or at least 95%, relative to a cell into which a nucleotide 
sequence coding for a transcription factor is not inserted or which is without modified 
30 expression of the already present nucleotide sequence coding for a transcription factor. 

In yet another aspect, the invention relates to a method of producing a metabolite 
including a plant secondary metabolite, the method comprising providing a recombinant 
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cell as described above, cultivating said cell under conditions where the nucleotide 
sequence coding for the transcription factor regulating the expression of at least one gene 
coding for a gene product involved in the metabolite production is expressed, and 
recovering the metabolite. 
5 In said method, the ceil is preferably cultivated under conditions where at least one 

precursor for the metabolite is added in a form which can be assimilated by said cell. 

However, this method also encludes cultivating and/or maintaining a plant that 
contains at least one recombinant cell as described above, and then recovering the 
metabolite from the plant, e.g. by harvesting the plant or at least a part, tissue or organ 
10 thereof and then recovering the metabolite from the plant material thus obtained. 

The invention also relates to a method of constructing a recombinant cell as 
described hereinabove, said method comprising the steps of 

(i) identifying in a source cell a nucleotide sequence coding for a transcription factor 
other than a transcription factor having a bHLH-type or a MYB-type DNA-binding 

15 domain, said transcription factor is capable of regulating in the source cell the 

expression of at least one gene coding for a gene product involved in biosynthesis 
of a metabolite or a precursor therefor, 

(ii) isolating said nucleotide sequence; and 

(iii) inserting said isolated nucleotide sequence into a host cell comprising a gene 

20 coding for a gene product involved in biosynthesis of a metabolite or a precursor 

therefor, the insertion of said nucleotide sequence resulting in that the expression of 
the gene of the host cell is modulated relative to the expression level in a host cell 
not comprising the inserted nucleotide sequence. 

In one embodiment of this method, at least two isolated nucleotide sequences are 
25 inserted into the host cell. 

Another aspect of the invention relates to a polypeptide which includes an amino 
acid sequence having at least 40% homology including at least 50%, such as at least 60%, 
at least 70%, at least 80% and at least 90% to an amino acid sequence selected from the 
group consisting of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6; and/or to a 
30 polypeptide having the amino acid sequence shown in SEQ ID NO:4, SEQ ID NO:5 or 
SEQ ID NO:6. 
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The invention also relates to an isolated nucleic acid molecule comprising a 
nucleotide sequence coding for an amino acid sequence selected from the group consisting 
of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6; and/or to an isolated nucleic acid 
comprising a nucleotide sequence which is selected from the group consisting of SEQ ID 
5 NO: 1, SEQ ID NO:2 and SEQ ID NO:3 and a mutant, allele or variant hereof. 

The invention also relates to a vector comprising a such nucleic acid molecule. 
Such a vector may further comprise at least one further sequence coding for a transcription 
factor. 

Also, the invention relates to a host cell comprising a such a vector. Such a host 
10 cell may for instance be selected from the group consisting of a bacterium, a fungal cell, a 
yeast cell, an animal cell, a unicellular eukaryotic cell, an algae cell and a plant cell. Such a 
host cell may also be a cell suitable for transformation of another cell, such as an 
Agrobacterium cell. 

The invention also relates to a plant comprising such a cell, e.g. in at least one part, 
1 5 organ or tissue. 

In yet another aspect, the invention relates to a method of isolating a nucleotide 
sequence coding for a transcription factor not having a bHLH-type or a MYB-type DNA- 
binding domain, said transcription factor is capable of regulating in a cell the expression of 
at least one gene coding for a gene product involved in biosynthesis of a metabolite or a 
20 precursor therefor, the method comprising the steps of 

(i) probing a cDNA or genomic library with a probe comprising at least a fragment of 
a nucleic acid as described above; 

(ii) identifying a DNA clone that hybridises with said at least a fragment of a nucleic 
acid; and 

25 (iii) isolating the DNA clone identified in step (ii) wherein the nucleic acid sequence is 
coding for all or a part of said transcription factor. 

Furthermore, the invention relates to a method of isolating a nucleotide sequence 
coding for a transcription factor not having a bHLH-type or a MYB-type DNA-binding 
domain, said transcription factor is capable of regulating in a cell the expression of at least 
30 one gene coding for a gene product involved in biosynthesis of a metabolite or a precursor 
therefor, the method comprising the steps of 
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(i) synthesising an oligonucleotide primer set corresponding to at least a fragment of a 
sequence selected from the group consisting of SEQ ID NO:l, 2 and 3; and 

(ii) amplifying cDNA or genomic DNA using said primer set in a polymerase chain 
reaction; 

5 wherein the amplified nucleic acid fi-agment is coding for all or a part of said transcription 
factor to obtain a nucleotide sequence coding for a transcription factor not having a bHLH- 
type or a MYB-type DNA-binding domain. 

According to one specific embodiment, the invention relates to an isolated nucleic 
acid which is a nucleotide sequence as shown in SEQ ID NO: 7 and a mutant, allele or 

10 variant hereof; and to the use of such an isolated nucleic acid sequence according to of 
SEQ ID NO: 7 or a mutant, allele or variant thereof, for isolating a nucleotide sequence 
coding for a transcription factor not having a bHLH-type or a MYB-type DNA-binding 
domain, said transcription factor is capable of regulating in a cell the expression of at least 
one gene coding for a gene product involved in biosynthesis of a metabolite or a precursor 

15 therefor, the method comprising the steps as essentially described in Example 2 
hereinbelow. 

Also, in yet another aspect, the invention relates to a plant, plant cell or plant 
material that has been transformed with a genetic construct comprising at least one 
nucleotide sequence encoding at least one transcription factor, operably linked to an 

20 expression regulating sequence, in which the plant (cell), genetic construct, transcription 
factor and expression regulating sequence may be as defined above. 

Preferably, the expression regulating sequence is heterologous to the plant (cell) 
and/or is an expression regulating sequence with which the at least one nucleotide 
sequence that encodes said at least one transcription factor is not natively associated. 

25 Preferably, in such a transformed plant cell or transformed plemt (e.g. in at least one 

part, organ, tissue or cell thereof), 

- the intracellular level(s) of one or more metabolites are enhanced, the intracellular 
levels of one or more (other) metabolites are reduced; or a combination thereof, e.g. as 
described above and/or by the amoimts indicated above; and/or 
30 - the expression of one or more genes involved in the biosynthesis of a metabolite or a 
precursor herefor are enhanced; the expression in the cell of one or more genes 
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involved in the biosynthesis of a metabolite or a precursor herefor ar reduced; or a 
combination thereof, e.g. as decribed above and/or by the amounts indicated above. 
The invention also relates to cultivating material for and/or of such a transformed 
plant, including but not limited to seed, tubers, fruits, roots, bulbs and/or seedlings. 
5 The invention also relates to descendants of such a transformed plant, which may 

for instance be obtained by sexual or asexual reproduction, including but not limited to 
standard cultivation techniques, starting from a plant transformed according to the method 
of the invention. Preferably, in (at least one cell, part, tissue or organ of) such a 
descendant, 

!0 - the intracellular level(s) of one or more metabolites are enhanced, the intracellular 

levels of one or more (other) metabolites are reduced; or a combination thereof, e.g. as 
described above and/or by the amounts indicated above; and/or 

- the expression of one or more genes involved in the biosynthesis of a metabolite or a 
precursor herefor are enhanced; the expression in the cell of one or more genes 

15 involved in the biosynthesis of a metabolite or a precursor herefor ar reduced; or a 

combination thereof, e.g. as decribed above and/or by the amounts indicated above; 
compared to the corresponding native (e.g. non-transformed) plant. 

Also, the invention relates to a method for providing a transformed plant or a 
descendant of such a transformed plant, in which, compared to the corresponding native 
20 (e.g. non-transformed) plant, 

- the intracellular level(s) of one or more metabolites are enhanced, the intracellular 
levels of one or more (other) metabolites are reduced; or a combination thereof, e.g. as 
described above and/or by the amounts indicated above; and/or 

- the expression of one or more genes involved in the biosynthesis of a metabolite or a 
25 precursor herefor are enhanced; the expression in the cell of one or more genes 

involved in the biosynthesis of a metabolite or a precursor herefor ar reduced; or a 
combination thereof, e.g. as decribed above and/or by the amounts indicated above; 
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said method comprising the steps of: 

(a) transforming a plant, plant cell or plant material with a genetic construct 

comprising at least one nucleotide sequence encoding at least one transcription 
factor, operably linked to an expression regulating sequence 
5 (b) optionally cultivating the plant, plant cell or plant material into a mature plant; 

and/or 

(c) optionally providing one or more further generation of the transformed plant of 
step (a) and/or step (b) by sexual or asexual reproduction, including but not 
limited to standard plant cultivation and/or plant breeding techniques; 
10 in which the plant (cell), genetic construct, transcription factor and expression regulating 
sequence may be as defined above 

In yet another aspect, the invention relates to plant material of and/or obtained from 
a plant as described above. 

Also, the invention relates to a metabolite, in particular a primary or secondary 
15 plant metabolite (e.g. as described above), obtained via the method described above and/or 
from a plant, plant cell and/or plant material as described above; or from a descendant of 
such a plant. 

Other aspect, embodiments, advantages and possible applications of the invention 
will become clear from the description hereinbelow. 

20 

DETAILED DISCLOSURE OF THE INVENTION 

A major objective of the present invention is to provide a method of modulating in 
a cell the expression of one or more genes involved in the biosynthesis of a metabolite or a 

25 precursor for the metabolite. As mentioned above, the method comprises that a nucleotide 
sequence coding for a transcription factor other than a transcription factor having a bHLH- 
type or a MYB-type DNA-binding domain is inserted into the cell such that the nucleotide 
sequence is operably linked to at least one expression regulating sequence, and/or that the 
expression of a nucleotide sequence coding for such a transcription factor already present 

30 in the cell is modified. 

In the present context, the term "modulating'' with respect to (the expression of) a 
gene refers to any change in the level of expression of a gene involved in the biosynthesis 
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of a metabolite or a precursor for the metabolite relative to the level of expression in a 
corresponding cell type not having a modulated expression of the same gene or genes, i.e. 
in the wild type state. It will be appreciated that "modulating" may imply both reduced and 
enhanced levels of expression relative to that found in the wild type cell, e.g. as outlined 
5 above. 

With respect to metabolites - including but not limited to primary metabolites, 
secondary metabolites and/or precursors therefor, as well as intermediates of metabolic 
routes leading to such metabolites - ''modulating'' refers to any change in the level of said 
metabolite or a precursor for the metabolite relative to the level thereof in a corresponding 
0 cell in the wild type state. It will be appreciated that "modulating" may imply both reduced 
and enhanced levels of said metabolites relative to those found in the wild type cell, e.g. as 
outlined above. 

As used herein the term "ceir encompasses any eukaryotic or prokaryotic cell. 
Thus, cells include bacterial cells, fungal cells including yeast cells, animal cells, 
dictyostelium cells, algae cells, and plant cells. The cell may be in the form of a single, 
isolated cell or it may in the form of multicellular tissues including an animal or a plant 
organ, plant callus tissues and an entire animal or plant. 

The expression "nucleotide sequence coding for a transcription factor'' is used 
herein encompasses any nucleotide sequence coding for a transcription factor which does 
not comprise a bHLH-type or a MYB-type DNA-binding domain. It will be appreciated 
that such a coding sequence may consist solely of coding DNA (ORFs) or, when it is 
derived from a eukaryotic cell, it may include introns. 

In this context, the expression "transcription factor" refers to a protein that in a 
sequence-specific manner binds to DNA by recognising specific elements located in the 
promoter and/or enhancer regions of the corresponding genes. The binding of such 
transcription factors to the specific DNA elements modulates the activity of other 
components of the transcription machinery including basal transcription factors and RNA 
polymerase, and thereby positively or negatively modulates the initiation of mRNA 
synthesis. Transcription factors have in common that they comprise DNA binding domains 
capable of binding to specific elements of their target genes. 

Transcription factors are classified in families based on conserved features of their 
DNA-binding domain. Within each family, DNA-binding domains are similar, but not 
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identical. Small differences in amino acid sequences can cause significant differences in 
the DNA sequence requirement for binding. As a result of these differences within a 
family, it is impossible to reliably predict whether a certain DNA sequence will bind a 
certain transcription factor family member with high affinity, and conversely, based on the 
5 amino acid sequence of a certain transcription factor family member it is impossible to 
reliably predict which sequence it will bind with high affinity. 

Transcription factors have a modular structure, consisting of domains with 
specialised functions, such as sequence-specific DNA binding, transcriptional activation, 
interaction with themselves or other proteins. These domains are in many cases functional 
10 even when fused to heterologous proteins. ORCA proteins contain the AP2-type DNA- 
binding domain. Since they activate transcription, it can be concluded that they contain an 
activation domain. It is conceivable that in e.g. Catharanthus cells the activity of AP2 
domain class transcription factors is modulated via post-translational modifications 
(phosphorylation, myristoylation etc.) via interaction with other proteins, or combinations 
15 thereof. The activity of AP2 domain class transcription factors can be modulated using 
recombinant DNA techniques. The objectives could be to make the activity of the 
transcription factors independent from post-translational modifications, to enhance their 
activity, or to make the activity dependent on convenient inducing conditions. These 
objectives can be accomplished via single amino acid modifications, deletions of several 
20 amino acids, fusions with other proteins or protein domains, or combinations thereof AP2 
domain-containing transcription factors may be constitutively targeted to the nucleus by 
addition of a nuclear localisation signal. Strong activation domains could be fused to the 
transcription factors in addition to or as a replacement for existing activation domains. 
Such activation domains include the Herpes simplex virus VP 16 domain, the GAL4 
25 activation domain, or synthetic activation domains known to be active in plants, 

Inducibility of transcription factor activity can be achieved via fusion with mammalian 
steroid hormone receptor domains. The resulting chimera can be activated by steroid 
hormone addition. 

In accordance with the above method of the invention, the nucleotide sequence 
30 coding for the transcription factor is operably linked to at least one expression regulating 
sequence. In the present context, the term ''operably linked' refers ih general to a situation 
where a first nucleic acid sequence is linked with a second nucleic acid sequence and the 
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first nucleic acid sequence is placed in a functional relationship with the second nucleic 
acid sequence. For instance, a promoter is operably linked to a coding sequence if the 
promoter affects the transcription or expression of the coding sequence (i.e. that the coding 
sequence is under the transcriptional control of the promotor). Generally, operably linked 
5 nucleic acid sequences are contiguous and, where necessary to join two protein coding 

regions, in reading frame. However, operably linked nucleic acid sequences may also be in 
a functional relationship without being contiguous. Typical examples of ''expression 
regulating sequences'' include promoter sequences, enhancer sequences, translation leader 
sequences, sequences coding for anti sense RNA and 3' non-coding sequences. 
10 It will be understood that the term "promoter" includes both constitutive promoters 

and regulatable promoters. In respect of the latter type of promoters, the regulation is 
typically conferred by exogenous (extrinsic) or endogenous (intrinsic) factors including 
physical factors such as the temperature or light, chemical factors including the pH, the 
presence of inducer substances, absence/presence of nutrient compounds in the cell 
15 envirorunent. Most preferably, a promoter should be "operaA/e" in the cell to which it is 
provided, by which is meant that the promoter should be capable of causing and/or 
regulating in said cell the expression of one or more nucleotide sequences (e.g. genes) that 
are operably linked (as defined above) to said promoter. Such nucleotide sequences should 
be understood as being "under the control of said promoter. 
20 In one presently preferred embodiment, the cell in which the expression of one or 

more genes involved in the biosynthesis of a metabolite or a precursor herefor is a plant 
cell. 

The term ''plant cell" as used herein include protoplasts, fused cells, gamete 
producing cells, undifferentiated and differentiated cells, and cells which regenerate into 

25 whole plants. Accordingly, a seed comprising multiple plant cells capable of regenerating 
into a whole plant, is included in the definition of "plant cell". In useful embodiments, the 
plant cell is in the form of a cell culture. 

As used herein, the term "plant" refers to any organism of the kingdom Plantae, 
including Bryophytes, Pteridophytes, Spermatophytes {Gymnosperms and Angiosperms, 

30 including monocotyledonous and dicotyledonous), thallophytes (including algae such as 
Chrysophyta, Pyrrophyta, Euglenophyta, Rhodophyta, Phaeophyta and Clorophyta), a 
whole plant, any plant part or propagule thereof, seed or hybrid progeny and descendants. 
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a plant cell, a multiplicity of plant cells such as e.g. plant tissues, and plant cells in the 
form of a cell culture. Plantlets are also included within the meaning of "planf\ 
In specific embodiments, the method relates to plant cells of a species of the Gentianales 
order or the Cornales order such as a species selected from an indole alkaloid producing 
5 species. Such species include species of the Apocynaceae, Alangiaceae, Loganiaceae, 
Icacinaceae, Cornaceae and Rubiaceae families. One particularly interesting genus of the 
Apocynaceae family is the genus Catharanthus including the species C roseus, C 
coriaceus, C. lanceus, C. longifolius, C. ovalis, C. pusillus, C scitulus and C 
trichophyllus. 

10 In accordance with the invention, the nucleotide sequence coding for a transcription 

factor as defined above can be derived from any cell type comprising such a sequence. 
Particularly useful sources for such sequences are plant cells of any of the above types and 
forms- 

In one interesting embodiment, the nucleotide sequence coding for a transcription 
15 factor is a sequence coding for a transcription factor comprising at least one AP2 domain. 
Transcription factors comprising such a DNA-binding domain are generally referred to as 
AP2/EREBP transcription factors which constitute a class of transcription factors ±at at 
present have only been found in plants. They posses two features that are typical for 
transcription factors, namely sequence-specific DNA binding capability and ability to 
20 activate (or repress) transcription. (Riechmarm and Meyerowitz, 1998). 

Sequence similarity among the AP2/EREBP proteins is mostly limited to the AP2 
DNA-binding domain (APETALA2 domain) itself (approximately 68 aa), which is the 
only region conserved among all proteins of the family. The distinguishing characteristic 
of proteins of the AP2/EREBP family is that they contain either one or two AP2 domains. 
25 Thus this family can be divided into two subfamilies based on whether the proteins contain 
one or two AP2 domains, the EREBP subfamily and the AP2 subfamily, respectively. The 
AP2 domains in the AP2 subfamily are more related among members of this subfamily 
than to the AP2 domains of proteins of the EREBP subfamily and vice versa. However, 
many characteristics are common to the AP2 domains from both subfamilies. Two 
30 conserved segments are found within each AP2 domain, which have been referred to as the 
YRG element, and the RAYD element. The amino terminal part of the AP2 domains (the 
YRG element) is basic and hydrophilic. The carboxy terminal RAYD element contains a 
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central region that, in almost all AP2 domains, is predicted to adopt the configuration of an 
alpha-helix of amphipathic character. Solution structure determination of one particular 
Arabidopsis AP2 domain protein revealed that the YRG element and the next 20 to 30 
amino acids form three anti-parallel P-sheets that contact the DNA, whereas the RAYD 
element indeed adopts an a-helical structure as predicted. 

As used herein an AP2 DNA-binding domain is defined as an amino acid sequence 
having at least 35% amino acid residue identity such as at least 45%, at least 55%, at least 
65%, at least 75%, at least 85%, at least 95% or at least 98% residue identity to any of the 
AP2 domains described in Riechmann and Meyerowitz (1998). However, in the present 
context, any amino acid sequence having similarity to any of the two conserved segments, 
the YRG element and the RAYD element as defined by Riechmann and Meyerowitz 
(1998) are encompassed by the invention and considered as having an AP2 DNA-binding 
domain. 

In another embodiment, the transcription factor as used herein includes at least two 
AP2 domains such as at least three AP2 domains. 

In specific embodiments, the method is based on the use of a transcription factor 
coding sequence comprising at least one sequence coding for an AP2 domain selected 
from AP2 domains of SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:5 or a sequence coding 
for an amino acid sequence selected from SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6. 

It will be appreciated that in accordance with the invention, a functional AP2 
domain class transcription factor can be derived from a source cell naturally containing 
such a factor. However, it is contemplated that a transcription factor that is functional in 
accordance with the invention can be constructed either by isolating one or more AP2 
domains from a naturally occurring transcription factor and combining it with an amino 
acid sequence with which it is not naturally associated. Alternatively, an AP2 domain may 
be constructed synthetically and combined with further amino acids to provide a 
transcription factor active polypeptide. 

In accordance with the method of the invention, the nucleotide sequence may be a 
sequence coding for an amino acid sequence comprising an at least one AP2 domain 
having at least 40% homology, at least 50%>, at least 60%, at least 70%, at least 80% or at 
least 90% homology to any of the at least one AP2 domains of SEQ ID NO:4, SEQ ID 
NO:5 and SEQ IDNO:6. 
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As used herein, the term "homology" refers to homology at the amino acid level 
both in terms of amino acid identity and similarity. Similarity allows for ''conservative 
variation", i.e. substitution of one hydrophobic residue such as isoleucine, valine, leucine 
or methionine with another hydrophobic residue, or the substitution of one polar residue 
5 with another polar residue, such as arginine with lysine, glutamic acid with aspartic acid, 
or glutamine with asparagine. In the present context, similarity is defined and determined 
using the present version of the BLAST program (Altschul et al, 1997). 

In other specific embodiments, the nucleotide sequence coding for the transcription 
factor comprises at least one AP2 domain-coding sequence selected from SEQ ID NO:l, 
0 SEQ ID NO:2 and SEQ ID NO:3 or the nucleotide sequence coding for the at least one 
AP2 domain is a mutant, allele, derivative or variant of a nucleotide sequence coding for 
an AP2 domain of a sequence selected SEQ ID NO:l, SEQ ID NO:2 and, SEQ ID NO:3. 

In the present context, the terms "mutant", "allele", "derivative" or "variant" may 
be by one or more of addition, insertion, deletion or substitution of one or more 
nucleotides in the nucleotide sequence, leading to the addition, insertion, deletion or 
substitution of one or more amino acids in the encoded polypeptide while substantially 
maintaining biological activity of the encoded polypeptide. Also included are changes in 
the nucleotide sequences which make no difference to the encoded amino acid sequence. 

Variant, allelic, mutant or derivatised nucleotide sequences can e.g. be produced by 
standard DNA mutagenesis techniques or by chemically synthesising the variant DNA 
molecule. Such variants, alleles, mutants or derivatives do not change the reading frame of 
the protein-coding region of the nucleotide sequence and encode a protein having no or 
only minor changes in its biological function. 

In accordance with the method of the invention, the nucleotide sequence coding for 
a transcription factor, the expression of which is modulated, is either a homologous 
nucleotide sequence or a heterologous nucleotide sequence. As used herein, the term 
"homologous" refers generally to a nucleotide sequence present or originating in the same 
species and the term "heterologous" refers correspondingly to a sequence originating from 
a different species. 

It will be appreciated that when the sequence is a homologous sequence, the 
expression of the sequence may be modulated while it is in its natural location in the cell, 
but it is also contemplated that a homologous coding nucleotide sequence may be 
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translocated within the cell of origin. Whether homologous or heterologous, the 
transcription factor-encoding nucleotide sequence may be operabiy linked to a regulating 
nucleotide sequence with which it is not natively associated. 

In one step of the above method, a nucleotide sequence coding for a transcription 
5 factor is inserted into the cell. The insertion can be carried out using any conventional 
methods of inserting nucleic acid into cells. Such transformation methods include as 
examples microprojectile bombardment, microinjection, electroporation, liposome 
mediated uptake and transformation by means of a disarmed Ti-plasmid vector carried by 
Agrobacierium. Numerous plant transformation vectors are available for plant 
10 transformation, and genes encoding transcription factors according to the invention can be 
used in conjunction with any such vectors. The selection of vectors for use will depend 
upon the preferred transformation technique and the target species for transformation. For 
certain target species, different antibiotic or herbicide selection markers may be preferred. 
Selection markers used routinely in transformation include the nptll gene which confers 
1 5 resistance to kanamycin and related antibiotics the bar gene which confers resistance to the 
herbicide phosphinothricin, the hph gene which confers resistance to the antibiotic 
hygromycin, the dhfr gene which confers resistance to methotrexate and the Tdc gene, 
which confers resistance to 4-methyltryptophan. 

Also many vectors are available for plant transformation using Agrobacterium 
20 tumefaciens. These typically carry at least one T-DNA border sequence and include 

vectors such as pBINlP, pCAMBIA, pMOG402 and pMOG22. Typical vectors contain 
DNA borders for Agrobacterium-m^dxdiiQd transformation, the RX2 -derived tra function 
for mobilisation between E. coli and other hosts, and the OriT and OriV functions that are 
also from RK2. They usually contain a polylinker that is suitable for the cloning of plant 
25 expression cassettes containing their own regulatory signals. 

Transformation without the use of Agrobacterium tumefaciens can also be 
performed which circumvents the requirement for T-DNA sequences in the chosen 
transformation vector, and consequently vectors lacking these sequences can be utilised in 
addition. The choice of vector depends largely on the preferred selection for the species 
30 being transformed. 

In accordance with the invention, gene sequences intended for expression in 
transgenic plants can be assembled in expression cassettes behind a suitable promoter and 
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upstream of a suitable transcription terminator. These plant expression cassettes can then 
by use of standard methods be transferred to the plant transformation vectors described 
above. 

The selection of a promoter used in expression cassettes will determine the spatial 
and temporal expression pattern of the transgene in the transgenic plant. Selected 
promoters will express transgenes in specific cell types (such as leaf epidermal cells, 
mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves or flowers, 
for example) and this selection will reflect the desired location of expression of the 
transgene. Alternatively, the selected promoter may drive expression of the gene under a 
light-induced or other temporally regulated promoter. A further altemative is that the 
selected promoter be chemically and/or physically regulated. This provides the possibility 
of inducing expression of the transgene only when desired and caused by treatment with a 
chemical and/or physical inducer. 

A variety of transcriptional terminators are available for use in expression cassettes. 
They are responsible for the termination of transcription beyond the transgene and its 
correct polyadenylation. Appropriate transcriptional terminators are those which are 
known to function in plants and include the CAMV 35S terminator, the tml terminator, the 
nopaline synthase terminator, and the pea rbcs E9 and 3C terminators. They can be used in 
both monocotyledons and dicotyledons. 

It is a further objective of the present invention to provide the means of enhancing 
biosynthesis of metabolites in a cell as defined above. Although it is contemplated that the 
discoveries on which the present invention is based is applicable to any of these cell types 
and to any metabolite synthesised in such cells by genes the expression of which are 
regulated by transcription factors as defined herein, the method of the invention is 
presently preferred for the optimisation of biosynthesis of plant metabolites, in particular 
secondary metabolites including those mentioned above. However, It is also contemplated 
that the method of the invention can be used for the biosynthesis of novel and presently 
unknown metabolites, including novel plant secondary metabolites. 

Thus, the secondary metabolites the biosynthesis of which can be enhanced using 
the method of the invention include but are not limited to alkaloid compounds including 
indole alkaloids such as terpenoid indole alkaloids (TIAs), phenolic compounds and 
terpenoid compounds. A review of secondary plant metabolites is included in e.g. 
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Phytochemical Dictionary, A Handbook of Bioactive Compounds from Plants, edited by 
Harbome and Baxter, Taylor & Fr£incis, London and Washington, DC. In useful 
embodiments, the secondary metabolites are alkaloids derived from compounds such as 
tryptophan (indole alkaloids including vincristine and vinblastine), tyrosine (isoquinoline 
5 alkaloids including berberine, berbamine and stephamine), lysine (quinolizidine alkaloids 
including lupamine and sparteine), ornithine (tropane alkaloids including scopolamine and 
atropine), nicotinic acid, anthranilic acid and acetate. 

Primary metabolites the biosynthesis of which can be enhanced using the method 
of the invention include but are not limited to tryptophan and precursors thereof as well as 
10 terpenoids and precursors thereof 

In accordance with the invention the transcription factor as used in the above 
method is capable of regulating the expression of at least one gene coding for a gene 
product involved in metabolite biosynthesis. Typically, such a gene product is a protein 
such as an enzyme. In a useful embodiments the enzyme is involved in an alkaloid 
15 biosynthetic and precursor pathways including an enzyme that is selected from the group 
consisting of Anthranilate synthase (ASA); D-1 deoxyxylulose 5 -phosphate synthase 
(DXS); Geraniol 10-hydroxylase (GIOH); NADPHxytochrome P450 reductase (CPR); 
Tryptophan decarboxylase (TDC); Strictosidine synthase (STR); Strictosidine p-D- 
glucosidase (SGD); Desacetoxyvindoline 4-hydroxylase (D4H); Acetyl- 
20 CoA:deacetylvindoline 4-O-acetyltransferase (DAT). 

As mentioned above, the present invention has made it possible to modulate in a 
cell the expression of one or more genes involved in the biosynthesis of a metabolite or a 
precursor for the metabolite. Although in presently preferred embodiments the expression 
of the gene or genes is enhanced with the objective of improving the yield of a desired 
25 metabolite, it is also contemplated that it may, under certain conditions, be advantageous to 
be able to reduce the expression of one or more biosynthetic genes. The enhancement, or 
alternatively, the reduction of the gene expression, relative to a cell into which a nucleotide 
sequence coding for a transcription factor is not inserted or which is without modified 
expression of the already present nucleotide sequence coding for a transcription factor, is 
30 obtained by modifying the expression level of the nucleotide sequence coding for the 
transcription factor as defined herein. 
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In a specific embodiment, the expression in the cell of one or more genes involved 
in the biosynthesis of a metabolite or a precursor herefor is enhanced by at least 10%, at 
least 25%, at least 50%, at least 75% or at least 100%, relative to a cell into which a 
nucleotide sequence coding for a transcription factor is not inserted or which is without 
modified expression of the already present nucleotide sequence coding for a transcription 
factor. It will be understood that "enhancement of gene expression" is related to the 
increase in biosynthesis of the metabolite. It is also contemplated that the enhancement of 
gene expression can be higher such as by a factor of at least 1 .5 including at least 2, at least 
5 or at least 10. 

The enhancement of the expression of one or more genes involved in the 
biosynthesis of a metabolite or a precursor herefor can, in accordance with the invention, 
be achieved in either of two ways. One way is by inserting into the cell a nucleotide 
sequence coding for a transcription factor as defined herein and operably linking it to at 
least one expression regulating sequence, the cell comprising at least one target 
biosynthetic gene for the inserted transcription factor. An alternative way is by modifying 
the expression of a nucleotide sequence coding for such a transcription factor already 
present in the cell. In either of such ways, the modification results in that the transcription 
factor is produced in higher amounts than in a cell of a corresponding type that is not 
modified by insertion of a transcription factor-encoding sequence or by modifying the 
expression of such a sequence already present in the cell. It is evident that the expression 
of the biosynthetic gene is enhanced in a cell type not previously containing the 
transcription factor provided the cell contains a biosynthetic gene comprising a target 
sequence for the transcription factor. However, the expression level of an inserted 
transcription factor-encoding nucleotide sequence can also be enhanced by in vivo or in 
vitro modifications. 

In accordance with the invention, enhancement of the expression of a transcription 
factor encoding sequence ("upregulation") can be obtained by any known technique 
leading to enhanced gene expression. As an example, the coding sequence can be linked to 
a stronger promoter or the native promoter of the sequence can be structurally modified so 
as to improve its promoter strength. The expression may also be enhanced by modifying 
any other regulating sequence operably linked to the coding sequence e.g. by mutation. An 
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alternative approach is to modify the nucleotide sequence of the transcription factor- 
encoding sequence. 

A number of different sequences have been found to enhance gene expression from 
within the transcriptional unit and these sequences can thus be used in conjunction with the 
5 transcription factor-encoding nucleotide sequences according to the invention to increase 
their expression in transgenic plants. Various intron sequences have been shown to 
enhance expression, particularly in monocotyledonous cells. Intron sequences have been 
routinely incorporated into plant transformation vectors, typically within the non-translated 
leader. 

10 A number of non-translated leader sequences derived from viruses are also known 

to enhance expression, and these are particularly effective in dicotyledonous cells. 
Specifically, leader sequences from Tobacco Mosaic Virus, Maize Chlorotic Mottle Virus 
and Alfalfa Mosaic Virus have been shown to be effective in enhancing expression. 
Where the expression of the at least one gene involved in the metabolite 
15 biosynthesis is reduced ("down-regulated"), the reduction is typically by at least 10%, at 
least 25% or at least 50%, or at least 95%, relative to a cell into which a nucleotide 
sequence coding for a transcription factor is not inserted or which is without modified 
expression of the already present nucleotide sequence coding for a transcription factor. It is 
also contemplated that the reduction of gene expression can be higher such as by a factor 
20 of at least 1 .5 including at least 2, at least 5 or at least 1 0. 

Down-regulation of the expression of the transcription factor-encoding sequence 
can be obtained in several ways. Thus, as an example the expression is reduced by using 
anti-sense technology or "sense regulation" ("co-suppression"). In using anti-sense genes 
or partial gene sequences to down-regulate gene expression, a nucleotide sequence is 
25 placed under the control of a promoter in "reverse orientation" such that transcription 

yields RNA which is complementary to normal mRNA transcribed from the sense strand 
of the target gene. An alternative approach is to use a copy of all or part of the target gene 
inserted in sense, that is the same, orientation as the target gene, to achieve reduction in 
expression of the target gene by co-suppression. 
30 In another aspect of the invention there is provided a method of modulating the 

stress resistance of a cell, the method comprising inserting into the cell a nucleotide 
sequence coding for a transcription factor, the nucleotide sequence is operably linked to at 
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least one expression regulating sequence, and/or modifying the expression of a nucleotide 
sequence coding for such a transcription factor already present in the cell, the transcription 
factor is capable of regulating the expression of at least one gene coding for a gene product 
involved in the biosynthesis of said metabolite or precursor, and subjecting the cell to 
5 conditions where the inserted nucleotide sequence coding for a transcription factor is 
expressed or the expression of the nucleotide sequence already present in the cell is 
modulated, the expression of said nucleotide sequence resulting in a modified 
responsiveness of the cell towards exogenous stress conditions. 

In the present context, the expression "exogenous stress conditions'' should be 
10 understood as meaning any condition that results in that the grovs^h and function of the cell 
is impaired relative to optimum conditions for growth and function. In relation to a plant, 
such stress conditions include attack of the plant by a pathogen, abiotic adverse 
environmental conditions like cold or heat and drought (osmotic stress), insufficient or 
excess supply of nutrient compounds, insufficient or excess levels of plant hormones, U V 
15 light, low light levels and suboptimal pH conditions. 

In accordance with the invention the cell having modulated stress resistance may be 
any of the cell types as mentioned above in which at least one gene involved in the 
biosynthesis of a metabolite or a precursor for the metabolite is modulated as described 
above and to which there is referred. In particularly useful embodiments the cells having 
20 modulated stress resistance are plant cells of the above types and species including whole 
transgenic plants. 

In accordance with the present method of modulating stress resistance, the 
expression of at least one biosynthetic gene is modulated by modifying the expression of 
any type of transcription factors that has a target sequence in the cell. Thus, in one 

25 embodiment, the nucleotide sequence coding for a transcription factor is a sequence coding 
for a transcription factor comprising at least one AP2 domain, including at least two AP2 
domains or at least three AP2 domains of any of the above types and having any of the 
above sequences including such sequences that are operably linked to a regulating 
nucleotide sequence with which it is not natively associated. 

30 In accordance with the invention the plant cell which is modified to have enhanced 

stress resistance may be modulated in its expression of any of the above metabolites or 
gene products involved in metabolite biosynthesis. 
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Similar to what is described above in the context of the method of modulating in a 
cell the expression of one or more genes involved in the biosynthesis of a metabolite or 
precursor herefor, the present method includes that the expression in the cell of one or 
more genes involved in the biosynthesis of a metabolite or a precursor herefor is enhanced 
5 or reduced relative to a cell into which a nucleotide sequence coding for a transcription 
factor is not inserted or which is without modified expression of the already present 
nucleotide sequence coding for a transcription factor. Such an enhancement or reduction is 
achieved mutatis mutandis as it is described for the former method. 

In a further aspect, the invention provides a recombinant cell having, relative to its 

10 parent cell, enhanced or reduced biosynthesis of a metabolite or a precursor therefor, 
and/or enhanced or reduced expression of a gene product involved in metabolite 
production, the ceil comprising a nucleotide sequence coding for a transcription factor 
other than a transcription factor having a bHLH-type or a MYB-type DNA-binding 
domain, said transcription factor is capable of regulating the expression of at least one 

15 gene coding for a gene product involved in the metabolite production, said sequence is 

inserted into the cell and/or its expression is modified by operably linking it to a regulating 
sequence with which it is not natively associated. In useful embodiments such a cell is a 
plant cell of any of the types and species as mentioned above and modified as also 
described above. In other useful embodiments such a cell is a cell wherein at least one 

20 pathway leading to the biosynthesis of at least one precursor for the metabolite can be 
stimulated in the cell in the presence of an inducing agent. Such inducing agents include 
chemical compounds (e.g. an elicitor, jasmonate and a hormone) and physical factors (e.g. 
light, temperature, pH and shear stress). 

In another aspect, the invention provides a method of producing a metabolite 

25 including a plant secondary metabolite, the method comprising providing a recombinant 
cell as described above, cultivating said cell under conditions where the nucleotide 
sequence coding for the transcription factor regulating the expression of at least one gene 
coding for a gene product involved in the metabolite production is expressed, and 
recovering the metabolite. The cultivation of the recombinant cell of the invention is 

30 carried out using any conventional cultivation media and any conventional culturing 

methods for the particular type of cell. Such conventional culturing methods also includes 
methods where the cell is cultivated under conditions where at least one precursor for the 
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metabolite is added in a form which can be assimilated by the cell. In useful embodiments 
such precursors may include compounds such as e.g. tryptophan, tyrosine, lysine, 
ornithine, nicotinic acid, anthranilic acid and acetate. Also included are culturing methods 
where the recombinant cell of the invention is further genetically modified in order to 
5 stimulate the production of a precursor for the metabolite, and methods wherein a 

compound which stimulates the production of a precursor for the metabolite is added to the 
cell in form which can be assimilated by the cell. 

In a particular aspect of the invention there is provided a method of constructing a 
recombinant cell as described above, the method comprising the steps of 
10 (i) identifying in a source cell at least one nucleotide sequence coding for a 

transcription factor other than a transcription factor having a bHLH-type or a 
MYB-type DNA-binding domain, said transcription factor(s) is/are capable of 
regulating in the source cell the expression of at least one gene coding for a gene 
product involved in biosynthesis of a metabolite or a precursor therefor, 
15 (ii) isolating said nucleotide sequence and 

(iii) inserting said isolated nucleotide sequence(s) into a host cell comprising a gene 
coding for a gene product involved in biosynthesis of a metabolite or a precursor 
therefor, the insertion of said nucleotide sequence(s) resulting in that the expression 
of the gene of the host cell is modulated relative to the expression level in a host 
20 cell not comprising the inserted nucleotide sequence(s). 

In useful embodiments, at least two isolated nucleotide sequences are inserted into 
the host cell. 

A typical non-limiting example of how the method is carried out is described in the 
below examples. 

25 In further aspects, the invention provides an isolated nucleic acid molecule 

comprising a nucleotide sequence coding for an amino acid sequence selected from the 
group consisting of SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6 including such an 
isolated nucleic acid that comprises a nucleotide sequence which is selected from the 
group consisting of SEQ ID NO:l, SEQ ID NO:2 and SEQ ID NO:3 and a mutant, allele 

30 or variant hereof and a polypeptide which includes an amino acid sequence having at least 
40% homology including at least 50%, such as at least 60%, at least 70%, at least SOVo and 



wo 00/46383 



49 



PCT/N LOO/00075 



at least 90% to an amino acid sequence selected from the group consisting of SEQ ID 
NO:4, SEQ ID NO:5 and SEQ ID NO:6. 

There is also provided a polypeptide having the amino acid sequence shown in 
SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6. 
5 The invention also relates to a vector comprising a nucleic acid molecule as defined 

above including such as a vector that comprises at least one further sequence coding for a 
transcription factor and to a host cell comprising such a vector. The host cell is selected 
from the group consisting of a bacterial cell, a flmgal cell, a yeast cell, an animal cell, a 
unicellular eukaryotic cell, an algae cell and a plant cell. 

10 As mentioned above, the invention also provides a method of isolating a nucleotide 

sequence coding for a transcription factor not having a bHLH-type or a M YB-type DNA- 
binding domain, A typical example of a probe that is useful in such a method is a fragment 
of a nucleic acid as shou^i in SEQ ID NO: 1 , SEQ ID NO:2 and SEQ ID NO:3 and a 
mutant, allele or variant hereof. 

15 The invention also provides a method of isolating a nucleotide sequence coding for 

a transcription factor not having a bHLH-type or a MYB-type DNA-binding domain by 
using a nucleotide sequence as shown in SEQ ID NO:l, 2 and 3 in a polymerase chain 
reaction. 

In other aspects, there are provided an isolated nucleic acid having the nucleotide 
20 sequence as shown in SEQ ID NO:7 and a mutant, allele or variant hereof, and a method 
using this nucleic acid sequence for isolating a nucleotide sequence coding for a 
transcription factor not having a bHLH-type or a MYB-type DNA-binding domain. 

The invention will now be further illustrated by the following non-limiting 
examples and the drawings wherein: 
25 - Fig. I shows that the RV region of the S(rl BH promoter is required for elicitor and 
jasmonate responsiveness. Two transformed cell lines for each construct were 
subjected to a control treatment (C), or induced with purified elicitor (PE) or methyl- 
jasmonate (MJ). Constructs were introduced in C. roseus cell lines MP183L and BIX 
via particle bombardment smd Agrobacterium, respectively; 
30 - Fig. 2 illustrates that the RV region of the Strl promoter is an autonomous elicitor and 
jasmonate responsive element. Two transformed cell lines for each construct were 
given a control treatment (C) or induced with purified elicitor (PE), or methyl- 
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jasmonate (MJ). Constructs were introduced in C roseus cell line MP183L via particle 
bombardment; 

- Fig. 3 shows that mutations M2, 3, 4, and 5 in the RV region abolish elicitor and 
jasmonate responsiveness of the Str\ BH promoter. Two transformed cell lines for 

5 each construct were given a control treatment (C) or induced with methyl -jasmonate 

(MJ). Constructs were introduced in C. roseus cell line BIX via Agrobacterium; 

- Fig. 4 shows that mutations M3, 4, and 5 in the RV region abolish elicitor and 
jasmonate responsiveness of the Strl BH promoter and that mutant M2 weakens the 
responsiveness. Two transformed cell lines for each construct were subjected to a 

!0 control treatment (C) or induced with purified elicitor (PE) or methyl-jasmonate (MJ). 

Constructs were introduced in C. roseus cell line MP183L via particle bombardment; 

- Fig. 5 shows that Orca-2 mRNA. but not Orca-1 mRNA, is rapidly induced by MeJA 
and elicitor. C. roseus cells were exposed to MeJA (10 fiM) or partially purified elicitor 
(PE) for the number of hours (h) indicated at the top of the figure. (A) Orcal and Rps9 

15 mRNA levels after addition of MeJA. (B) Orca2, Str and Rps9 mRNA levels after 

addition of MeJA. (C) Orca2, Str and Rps9 mRNA levels after addition of PE. 
Northern blots were hybridized with Orcal, Orca2, Str and Rps9 cDNAs as indicated. 
C, control incubation with DMSO. 

- Fig. 6 illustrates that T-DNA activation tagging of the Orca-3 gene activates expression 
20 of the TIA biosynthetic genes tryptophan decarboxylase (TDC), strictosidine synthase 

(STR) and strictosidine glucosidase (SG). Northern blot analysis of RNA extracted 
from line 46 and a 4-mT-resistant control cell line (C). Blots were probed with Tdc\ 
Strl and Sg (6A). RNAs in figure 6B were hybridised with the rescued flanking plant 
DNA from line 46, containing the complete Orca-3 open reading frame; 

25 - Fig. 7 shows Orca3 expression patterns in elicited C roseus suspension cultures. C 

roseus cells were treated with 50 \xM methyl-jasmonate (MJ), 0.5% crude yeast extract 
elicitor (YE) or with an equivalent amount of partially purified elicitor (PE). Cells were 
harvested after the time points indicated in the figure. RNA was extracted and Northern 
blots were probed with the rescued flanking plant DNA from line 46, containing the 

30 complete Orca-3 open reading frame; 

- Fig. 8 illustrates that ORCA-1 and 2 are transcriptional activators of the iS/r 1 -promoter 
in C. roseus cells. GUS/CAT ratios with standard deviation were calculated from 
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duplicate enzyme activity measurements of three bombarded samples arising from one 
batch of coated particles in a representative bombardment experiment. Tungsten 
particles were coated with a mixture of piasmids containing BH-GUS, empty 
pMOG184, or pMOG184 carrying Orca-l or Orca-l in sense or anti-sense orientation 
5 with respect to the CaMV 35S promoter. To correct for transformation efficiency, a 

plasmid carrying CaMV35S-CAT was added to all coatings; 

- Fig. 9 illustrates that ORCA-1 and 2 transcriptionally activate the S'/r 1 -promoter in C 
roseus cells in a sequence-specific manner, GUS/CAT ratios with standard deviation 
were calculated from duplicate enzyme activity measurements of three bombarded 

10 samples arising from one batch of coated particles in a representative bombardment 

experiment. Tungsten particles were coated with a mixture of GUS piasmids and 
pMOG184 derivatives. The GUS gene was driven by the Str\ BH promoter or BH 
mutant derivatives Ml through M6. pMOGI84 piasmids were either empty (pM), or 
carried Orca-\ (Ol) or Orca-l (02) in sense orientation with respect to the CaMV 35S 

15 promoter. To correct for transformation efficiency, a plasmid carrying CaMV35S-CAT 

was added to all coatings; 

- Fig. 10 shows that ORCA-3 activates expression of the StrWH promoter (-202 to -1 
with respect to ATG start codon). C roseus cells were bombarded with tungsten 
particles coated with a mixture of piasmids containing S(r\ NH-GUS, empty 

20 pMOGl 84, or pMOGl 84 carrying Orca-^ in sense orientation with respect to the 

CaMV 35S promoter, and stained for GUS enzyme activity by addition of the substrate 
X-Gluc; 

- Fig. 1 1 shows that ORCA-3 activates expression of the Tdc -219 promoter (-219 to 
+86 with respect to ATG start codon). C roseus cells were bombarded with tungsten 

25 particles coated with a mixture of piasmids containing Tdc -219-GUS, empty 

pMOG184, or pMOGl 84 carrying ORCA-3 in sense orientation with respect to the 
CaMV 35S promoter, and stained for GUS enzyme activity by addition of the substrate 
X-Gluc; and 

- Fig. 12 shows that ORCA-1, -2 and -3 activate expression of the jasmonate-responsive 
30 RV element from the S/r 1 -promoter. C roseus cells were bombarded with tungsten 

particles coated with a mixture of piasmids containing 4RV-GUS, empty pMOG184, 
or pMOGI 84 carrying OrcaA, -2 or -3 in sense orientation with respect to the CaMV 
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35S promoter, and stained for GUS enzyme activity by addition of the substrate X- 
Gluc. 

- Fig. 13 shows that ORCA3 contains different domains affecting the transcriptional 
activation potential. (A) Schematic representation of ORCA3 and the different deletion 
5 derivatives. Numbers indicate amino acid positions. The AP2-domam is indicated as a 

black box, the acidic domain as a hatched box, and the serine-rich region as a dotted 
box. VP 16 indicates the heterologous Herpes simplex VP 16 activation domain. NLS 
shows the position of a putative bipartite nuclear localization signal. (B) Trans- 
activation potential of the ORG A3 deletion derivatives shown in (A). C.roseus cells 
10 were co-bombarded with the gusA reporter gene fused to the Str promoter, the 

reference reporter gene cat fused to the CaMV 35S promoter, and an overexpression 
plasmid carrying the ORCA3 deletions as indicated fused to the CaMV 35S promoter. 
GUS/CAT ratios are expressed as percentage of empty vector controls. Bars represent 
Mean ± SE (n=8 for empty vector, ORCA3 and A50RCA3; n=6 for A30RCA3 and 
15 A3A50RCA3; n=3 for VP16-A50RCA3). Different letters above the bars indicate 

statistically significant differences (P<0.05). 

- Fig, 14 shows that ORG A3 overexpression increases the expression of primary and 
secondary metabolite biosynthetic genes. Northern blot analysis was performed on 
RNA extracted from (A) tagged cell line 46 (T) and a representative control line 38 (G) 

20 in C.roseus genetic background BIX, (B) 10 03-0X lines and 5 control cell lines in 

genetic background MP183L, and (C) from untransformed MP183L cells exposed for 
6 h to 50 |iM methyl jasmonate (M) or DMSO (G). (D) RNA expression levels were 
quantified and averaged. Mean expression of the control lines is set at 1 . Bars indicate 
Mean ± SE. Asterisks above the bars indicate statistically significant differences, as 

25 judged from the non-parametrical Wilcoxon-Mann- Whitney test (P<0.05). 

- Fig. 15 shows that ORGA3 overexpression increases accumulation of tryptophan, 
tryptamine, and TIAs. Eight days after subculture, tryptophan , tryptamine and TIAs 
were extracted from 5 03-0X and 5 control lines, without loganin supply (left panel), 
or 24 hours after addition of 1 mM loganin to the culture medium (right panel). 

30 ~ Fig. 16 shows that ORGA2 overexpression induces the expression of TIA biosynthetic 
genes. Upper panel: Northern blot analysis was performed on RNA extracted from 14 
02-0X lines and 7 control cell lines. Lower panel: PLNA expression levels of 6 strong 
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Orca2 expressors and 5 control cell lines were quantified and averaged. Bars indicate 
Mean ± SE. Mean expression of the control lines is set at 1 . 
" Fig- 1 7 schematically shows the biosynthesis of TIAs in Croseus, Solid arrows indicate 
single enzymatic conversions, whereas dashed arrows indicate multiple enzymanc 
5 conversions. Genes regulated by ORCA3 or ORCA2 are underlined with solid or 

dotted lines, respectively. 

The abbreviations used to indicate the TIA biosynthetic enzymes are as follows: 

enzymes indicated in the Figure are as follows: Anthranilate synthase (ASA); D-1 
deoxyxylulose 5-phosphate synthase (DXS); Geraniol 1 0-hydroxy lase (GIOH); 
10 NADPHrcytochrome P450 reductase (CPR); Tryptophan decarboxylase fTDC); 

Striciosidine synthase (STR); Strictosidine p-D-glucosidase (SGD); 
Desacetoxyvindoline 4-hydroxylase (D4H); Acetyl-CoAideacetylvindoline 4-0- 
acetyltransferase (DAT). 

15 

EXPERIMENTAL PART 

Terpenoid indole alkaloids (TIAs) form a large group of natural products with 
--3000 known compounds, many of which are pharmacologically active, including well 

20 icnown drugs such as ajmalicine. vinblastine, vincristine, ajmaline. reserpine, 
rescinnamine, camptothecine, ellipticine, quinine, and quinidine. The leaves of 
Madagascar periwinkle or Catharanthus roseus synthesize the very costly anti-tumor drugs 
vinblastine and vincristine, while the roots synthesize ajmalicine, an alkaloid that improves 
cerebral blood circulation. In the past years, extensive studies of secondar>' metabolism in 

25 this plant species and cell cultures thereof have been carried out. The TIA pathway has 
been mapped, although several important steps are not yet elucidated. 

The central intermediate in TIA biosynthesis is the glucoalkaloid strictosidine. This 
compound is formed from the intermediates tryptamine and secologanin. a reaction 
catalyzed by the enzyme suictosidine synthase (STR; Fig. 1 7). Tryptophan decarboxylase 

30 (TDC) catalyzes the formation of tryptamine. Tryptophan is derived from chorismate via 
multiple steps, the first of which is catalyzed by anthranilate synthase (ASA). Secologanin 
is formed via multiple steps from geraniol. Geraniol is first 1 0-hydroxylated by a 
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cytochrome P450 enzyme (geraniol 1 0-hydroxyIase, GIOH), which depends on NADPH- 
cytochrome P450 reductase (CPR) for its activity, Geraniol has recently been shown to 
arise from the non-mevalonate terpenoid pathway (MEP pathway), of which the first step 
is catalyzed by D-1 deoxyxylulose 5-phosphate synthase (DXS). 
5 After strictosidine, the first step is hydrolysis of the glucoside (strictosidine P-D- 

glucosidase, SOD), resulting in a reactive dialdehyde. The pathway then diverges to the 
various types of alkaloids. No enzymes have yet been found that catalyze the formation of 
the different alkaloid skeletons. Several enzymes fiirther down the pathway leading to the 
dimeric alkaloids (vinblastine and vincristine) have been characterized (D4H and DAT; 
10 Fig. 1 7). From the TIA and precursor pathways in C roseus, the genes encoding DXS 
(accession number AJOI 1840), ASA (accession number AJ250008), GIOH (accession 
number AJ251269), CPR (accession number X69791), TDC (accession number X67662), 
STR (accession number X6I932), and SGD (accession number AFl 12888), have been 
cloned. D4h (accession number U71604) and Dat (accession number AF053307) genes 
15 acting further downstream in the pathway have also been cloned. 

For Tdc and Str the genomic sequences including elicitor-responsive promoter 
regions have been cloned. All cloned structural genes active in the TIA biosynthetic 
pathway are coordinately induced by jasmonic acid (Figure 14C). Based on the coordinate 
regulation, it was hypothesized that common regulatory proteins control the expression. 
20 Recently, two jasmonate-responsive transcription factors (called ORCA2 and ORG A3; for 
Octadecanoid-Responsive Catharanthus AP2) that regulate several structural genes were 
cloned. Both proteins belong to the AP2 -domain class of transcription factors, which is 
exclusively found in plants. The AP2-domain is responsible for DNA binding. The 
expression of the Orca2 and 3 genes is rapidly induced by jasmonate with faster kinetics 
25 than the structural genes Tdc and Str. Both ORCA2 and 3 proteins bind to the Tdc and Str 
promoters, and can activate the promoters in C.roseus cells in a transient assay. 
Elimination of their binding site from the Str promoter abolished jasmonate 
responsiveness. 

The (preliminary) data, illustrated by the examples below, indicate that ORCA2 
30 and 3 are master regulators of TIA biosynthetic gene expression, and control different 
subsets of TIA structural genes. In addition, ORG A3 regulates genes acting early in TIA 
metabolism, as well as a gene acting later in the pathway. Furthermore, ORCA3 also 
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regulates genes in primary metabolism leading to TIA precursors. The data strongly 
suggest that together, ORCA2 and 3 may regulate additional and perhaps all genes in the 
pathway. 

The biosynthesis of many classes of secondary metabolites is stimulated by 
5 jasmonates and several of its precursors. For a few pathways, for which enzymes and 

genes are known, this biosynthetic stimulation was shown to be preceded by induction of 
genes and corresponding enzyme activities that are active in these biosynthetic pathways. 
A non-exhaustive list of metabolite classes induced by jasmonates includes: taxoids, 
pheynlpropanoids, flavonoids, anthocyanins, guaianolides, anthraquinones, 
10 sesquiterpenoids, various types of alkaloids (benzophenanthridine alkaloids, terpenoid 
indole alkaloids). 

Genes induced by jasmonates include those encoding tyrosine/DOPA 
decarboxylase, berberine bridge enzyme, allene oxide synthase, 4-coumarate-coA ligase, 
phenylalanine-ammonia lyase, chalcone synthase, bergaptol methyl-transferase, 
1 5 lypoxygenase. 

In addition, many genes in primary metabolite biosynthetic pathways were 
shown to be induced by jasmonates, and include those encoding anthranilate 
synthase, isochorismate synthase, deoxyxylulose -phosphate synthase, 
cytochrome P450 reductase, hydroxy-methylglutaryl -coenzyme A reductase. 
20 In all these cases, ORCA-like AP2-domain proteins may be strongly suspected 
to mediate these jasmonate responses, and manipulation of the expression 
levels of such ORCA-like proteins may modulate the levels of these various 
metabolites. 

The techniques applied in the below examples are all standard recombinant DNA 
25 and molecular cloning techniques well known in the art and are e.g. described by 

Sambrook et al. (1989) Molecular cloning: A laboratory manual, second edition. Cold 
Spring Harbor Laboratory Press and Ausubel et al. (1987) Current protocols in molecular 
biology, Greene Publishing Associates and Wiley Interscience, New York. 



30 
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EXAMPLE 1 

1.1 Identification of an elicitor- and jasmonate-responsive element in the Str\ promoter 

The RV region of the promoter of the sirl gene (DDBJ/EMBL/GenBank database 
accession number Y10182) of Catharanthus roseus (Madagascar periwinkle) coding for 
5 strictosidine synthase was, as described below, via promoter emalysis including detailed 
scanning mutagenesis, identified as an elicitor- and jasmonate-responsive element. 

The Bgill/HincII fragment (BH) of the Str\ promoter and a BH fragment in which 
the Rsal/Avall fragment (RV) was deleted (BH'*^^) were introduced as Bglll/Sall 
fragments into the GusSH vector consisting of pBluescript SK+ carrying the p- 
10 glucuronidase reporter gene. These Str\ promoter derivatives fused to the gusA gene were 
introduced into two different C roseus cell lines. Cell line MP183L was transformed via 
particle bombardment, whereas cell line BIX was transformed via an improved 
Agrobacterium strain (see example 4.1). For Agrobacterium-m^dmlQd transformation, the 
GusSH derivatives were introduced into the binary vector pMOG221ambdaCAT, a 

!5 derivative of pMOGlambdaCAT containing the hygromycin resistance gene instead of the 
kanamycin resistance gene as a plant selectable marker. Northern blot analysis was 
performed for two independently transformed cell lines for each construct, each estimated 
to consist of a population of at least 100 to 1000 independent transformants. The BH 
fragment of the Strl promoter conferred low basal level expression onto the gusA gene in 

20 the control (0.05% dimethylsulfoxide (DMSO)) treated cells (Fig. 1 ). Incubation of 

MP183L cells transformed with BHGusSH with partially purified yeast extract elicitor 
(PE; Menke et al.. Plant Physiology, April 1999, Vol. 11 9, pp. 1289- 1296) and methyl- 
jasmonate (MJ) for 6 h induced the expression of the gusA gene several fold (Fig. 1 upper 
panel). These conditions similarly induced the endogenous Strl gene as described 

25 previously (Menke et al.,1999). The BH**^^ fragment did not confer elicitor- and MJ- 

responsive gene expression onto the gusA reporter gene in cell line MP183L (Fig. I upper 
panel). Similar results were obtained with BIX cell lines transformed with these Str\- 
promoter derivatives fused to gusA. BIX cells carrying the ^H-gusA fusion had low basal 
gusA mRNA levels, but incubation for 6 h with MJ induced high gusA gene expression 

30 (Fig.l lower panel). BIX cells carrying BH-RV fused to gusA did not have detectable basal 
expression levels and could not be induced by MJ (Fig.l lower panel). This loss of 
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function experiment shows that the RV region of the Strl promoter is required for elicitor- 
and jasmonate-responsive gene expression. 

In a gain of function experiment it was tested whether the RV fragment of the Sir I 
promoter was able to confer elicitor-and JA-responsive gene expression onto a minimal 
5 promotGT-gusA fusion. To this end tetramers of the RV region were constructed and fused 
to a Caulifower Mosaic Virus 35S promoter truncated to -47 and fused to gusA in the 
vector GusSH-47. The RV region was cloned as a Rsal/Avall fragment into the 
Smal/EcoRV sites of pIC20H, and tetramerized using the enzymes BamHI/Bglll. The 
tetramer was cloned as a BamHI/Bglll fragment into the BamHI site of GusSH-47. C. 

10 roseus cell line MPl 83L was transformed with this construct via particle bombardment. 
Control cell lines were generated by introducing GusSH-47 or 6Tcyt-GusSH-47 in two 
independently transformed cell lines for each construct. The 6Tcyt construct contains 6 
head-to-tail copies of the cyt-1 cis-acting element from the T-DNA T-CYT gene which is 
not elicitor- or jasmonate-responsive. Northern blot analysis was performed for two 

15 independently transformed cell lines, each estimated to consist of a population of at least 
100 to 1000 independent transformants. Incubation of the 4RV-47 cell lines with PE or MJ 
for 6 h resulted in induction of gusA gene expression as compared to the expression of the 
control (0.05% DMSO) incubated cells (Fig.2). The same treatment of cell lines carrying 
the -47 construct did not result in measurable gusA mRNA levels in the control or induced 

20 samples (Fig.2). In the cell lines carrying the 6Tcyt-GusSH-47 construct, high basal levels 
of gusA mRNA were found (Fig.2). Treatment of these cell lines with PE or MJ for 6 h did 
not induce expression (Fig.2). These results show that the RV region is sufficient to confer 
elicitor- and jasmonate-responsive gene expression onto an otherwise inactive minimal 
promoter-gusA construct. The results with the 6Tcyt lines show that the RV region does 

25 not potentiate a silent elicitor-responsive element within the CaMV -47 region or 
elsewhere on the construct. 

Detailed block scanning mutagenesis of the RV region in the context of the BH 
fragment of the Strl promoter was performed to pinpoint the exact location of the elicitor- 
and jasmonate-responsive-sequence within the RV fragment. Mutations were introduced 

30 by PGR with primers having a 6 bp mismatch combined with M13-reverse primer using 
the XhoI/HincII fragment of the Strl promoter cloned in pIC20H as a template. The 
primers RVM1-RVM8 are disclosed in the below Table 1.1 (SEQ ID NOs 8-15) 
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Table 1.1 Primers used in detailed block scanning mutagenesis of the RV region 



KVM 1 


r'C A cnTnn'TTCiT A C^TCTCT^ Ad AC C-"^' 

J — \^ / \ V VJ 1 VJ VJ 1 1 VJ 1 /W_J I 1 V — 1 1 / v VJ -f^ V-^ V — Jf 


KVMz 


nnTACATCACjACAATCrACCaCCTTC-V 


In. V LVLJ 


5'-CACTCTTACTGGCGCTTCTTTGAAAG-3' 


RVM4 


5'-AGACCGCGAAGAATGAAAGTG-3' 


RVM5 


5'-AGACCGCCTTCTTACTTTCTGATTTCCCC- 
3' 


RVM6 


5'-TTGAAAGACTAAACCCTTGGAC-3' 


RVM7 


5'-GAAAGTGATTTGGGAACGACCTTG-3' 


RVM8 


5'-CCCTTGCTGGAAGTTTGGTGAG-3' 



PCR fragments obtained were used as primers in PGR reactions together with MI3- 
forward primer on the XhoI/HincII fragment of the Strl promoter cloned in pIC20H. PCR 
5 fragments obtained were reamplified using M13-reverse and M13-forward primers. 

Amplified PCR products were digested with Bglll/Sall and cloned into GusSH digested 
with BamHI/Sall. In total 8 mutated versions of the RV fragment were made, covering the 
entire fragment from the Rsal site to the Avail site. In each mutant 6 adjacent nucleotides 
(nt) were changed into their complementary nt, i.e. changing A to T and G to C. The 8 

10 mutated versions of the BH fragment fused to gusA were introduced into C. roseus cell 

lines MPl 83L and BIX as described above. Northern blot analysis was performed for two 
independently transformed cell lines for each mutant construct, that were incubated for 6 h 
with DMSO or MJ for C roseus BIX cell lines (Fig.3) and with DMSO, PE or MJ for C 
roseus MP183L cell lines (Fig.4). By comparing basal and MJ-induced gusA gene 

15 expression for all the mutant BIX lines to the expression conferred by wildtype BH 
fragment it is clear that mutations 1, 6, 7 and 8 conferred a wildtype gene expression 
pattern (Fig.3). In mutant 2 the expression was only minimally inducible. In mutants 3, 4 
and 5 expression was uninducible, and similar to the expression conferred by the BH-RV 
fragment (Fig.3). The experiment with the MP183L lines carrying the mutated BH 

20 fragment gave similar results (Fig.4), Mutants 1,2 and 6 showed expression patterns 

similar to the wildtype BH fragment. Mutants 3, 4 and 5 are not consistently inducible by 
PE or MJ and gave similar expression patterns to the BH-RV fragnient. 
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In another experimental series, the effect of the block mutations was studied in 
constructs consisting of tetramers of RV derivatives fused to GusSH-47. Tetramers of RV 
mutants M2, M3, M4, M5 and M6 were constructed as described above for RV wildtype 
fragment, and fused to a Cauiifower Mosaic Virus 35S promoter truncated to -47 fiised to 
gusA in the vector GusSH-47. The constructs were introduced in C roseus cell Ime 
MP183L via particle bombardment. Two independent transformed cell lines were 
generated for each construct. Incubation of the cell lines carrying RV wildtype or mutant 
M6 tetramer constructs with PE or MJ for 6 hours resulted in induction of gusA gene 
expression as compared to the expression of the control (0.05% DMSO) incubated cells 
(results not shown). The same treatment of cell lines carrying the mutant M2, M3, M4 or 
M5 tetramer constructs did not result in measurable gusA mRNA levels in the control or 
induced samples (results not shown). 

These results show that the elicitor- and MJ-responsive element is contained within 
the 24 bp sequence that is covered by mutations 2 to 5, with mutations 3, 4 and 5 having 
the most dramatic negative effect on the inducibility of the BH or the RV fragment of the 
Str\ promoter. 

EXAMPLE 2 

rinning of PRC A- 1 and 2 

The RV region was, as described in Example 1, identified as an elicitor- and 
jasmonate-responsive element in the Str\ promoter. Use of the RV element as bait in a 
one-hybrid transcription factor screening resulted in the isolation of 17 cDNA clones that 
corresponded to two different mRNA species, encoding proteins belonging to the 
APETALA2 domain class of transcription factors. They were named ORCA-1 and 2 (for 
Octadecanoid-Responsive Catharanthus AP2). 

The RV region of the Str\ promoter (DDBJ/EMBL/GenBank database accession 
number Yl 01 82) was cloned as a Rsal/ Avail fragment into the Smal/EcoRV sites of 
pIC20H and tetramerized using the enzymes BamHI/Bglll. The tetramer was cloned as a 
BamHI/Bglll fragment into the BamHI site of pHIS3NX. The teU-amer-HIS3 gene fusion 
was transferred as a Notl/Xbal fragment to the yeast integration plasmid pINTl. 

The resulting plasmid was linearized with Ncol and introduced into yeast strain 
Y187. Recombinants were selected on YPD medium containing 150 ^g/ml G418, and the 
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occurrence of a single recombination event between the pINTl derivative and the 
chromosomal PDC6 locus was verified via Southem blotting. Yeast strain Y187-4RV was 
then used in a one-hybrid screen for DNA-binding proteins with a cDNA library of 
Catharanthus roseus cloned in the plasmid pACTII. To construct the library, cDNA was 
5 synthesised with a Stratagene kit on 2 different poly(A) RNA preparations mixed in a 1 : 1 
ratio isolated from cell suspensions that were elicited with 0.05 % yeast extract for 1 and 4 
h, respectively. The cDNA was cloned in the EcoRI/XhoI sites of lambdaACTII. The 
amplified lambda librsiry, consisting of 3.5 x 10*^ independent primary transformants was 
converted to a pACTII plasmid library via Cre-lox-mediated in vivo excision in Cre- 
10 expressing in E, coli strain BNN132. Four million Y187-4RV transformants were screened 
in a one-hybrid screening, Plasmids isolated from 364 colonies obtained on SD medium 
without histidine and leucine were retransformed to Y187-4RV, and two Y187 derivatives 
carrying tetramers of RV mutant fragments RVM4 and RVM5 fused to the HIS3 gene. 
Seventeen plasmids showed strongly reduced growth in Y187-4RVM4 compared to Y187- 
15 4RV and Yl 87-4RVM5, whereas all other plasmids did not give rise to significant 

differences in growth between the three yeast strains. Partial sequencing of these plasmids 
revealed that they belonged to one of only two classes. The ORCA-1 class consisted of 3 
plasmids carrying cDNA sequences with identity to the sequences in SEQ ID NO 1 . SEQ 
ID No:l is derived from plasmid RVI24. Both sequences are identical throughout the main 
20 open reading frame, but diverge at their 3' ends. Obviously, they are derived from the same 
gene, and the differences could have arisen by alternative splicing of the primary 
transcript. They contain the complete coding region, since SEQ ID No. 1 contains an in 
frame stop codon preceding the first start codon. The ORCA-2 class consists of 14 
plasmids carrying cDNA sequences with identity to the sequence in SEQ ID No:2. All 
25 ORCA-2 cDNA sequences appeared to be partial. The missing portion was isolated via 
PGR with the primers 4TH (5'-CCCCACCAAACCCAAAAAAAG-3'; SEQ ID NO: 16) 
and RV117 (5'-CCATATCCTCGATCCTTTTCTC-3*; SEQ ID NO: 17) using the pACTII 
cDNA library as a template. A prominent 0.6 kbp PCR band was digested with EcoRI and 
BamHI, and cloned in pBluescript II SK+. A complete clone was constructed in 
30 pBluescript II SK-H by fusion of the PCR fragment with the cDNA fragment fi-om plasmid 
RV2 1 0 using the unique BamHI site. 



wo 00/46383 



61 



PCT/NLOO/00075 



EXAMPLE 3 

Rx pression of ORCA-1 and -2 

To analyse the expression of the Orca-1 and Orca-2 genes suspension-cultured C 
roseus cells were incubated with MJ (10 \iM) or PE (0.05% YE equivalent). After 
5 incubation periods ranging from I to 24 h the cells were harvested and immediately frozen 
in liquid nitrogen. Total RNA was isolated and Northern blots were probed with the full- 
length sequence from SEQ ID NO:l, specific for Orcal, and a 450 bp EcoRI/Kpnl 
fragment from SEQ ID NO:2, specific for Orca-2. Orcal gene expression was not 
modulated by MJ (Fig.5A), or PE (data not shown). MJ-induced gene expression was clear 
0 for Orca-2 with high levels of expression starting at 0.5 hours and remaining high to up to 
8 hours after MJ addition as compared to the control (Fig.SB). Orca-2 was also rapidly 
induced by PE, with an increase of mRNA levels within 1 h and a high level to up to 8 
hours after PE addition (Fig.5C). These results show that Orca-2 is inducible by MJ and 
PE, whereas Orcal expression is not affected. 

EXAMPLE 4 
Cloning of ORCA-3 

T-DNA activation tagging has been used as a reverse genetic method to isolate 
genes involved in plant growth and development. Genes involved in responses to ABA and 
cytokinin have been identified using this technique. A T-DNA carrying a strong 
transcriptional enhancer is integrated into the plant genome by Agrobactehum 
lume/aciens-mcdiaicd transformation. Expression of genes adjacent to the integration site 
will become deregulated, which can result in a dominant mutation. 

To isolate dominant mutants with increased expression of TIA (terpenoid indole 
alkaloid) biosynthetic genes, the Tdc gene was used as selectable marker. TDC is involved 
in the conversion of L-tryptophan into tryptamine, one of the first steps in TIA 
biosynthesis. TDC is able to use substrates other than L-tryptophan, like 4- 
methyltryptophan (4-mT). This compound is toxic to plant cells, and is converted by TDC 
in to the non-toxic 4-methyItryptamine. We used 4-mT to select for C roseus cell lines in 
which the activation tag causes an increase in Tdc expression. The selection for ceil lines 
with increased Tdc expression is described below. 
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4.1 Plasmids and A^robacterium tumefaciens strain 

Plant cell transformations were done by applying the ternary vector system (van der 
Fits et al., manuscript in preparation). A Kpnl fragment carrying a constitutive virG mutant 
gene was cloned onto plasmid pBBRlMCS and this plasmid was used as ternary vector. 

5 A brief outline of the cloning of the binary vector Tag-2B4 A 1 follows: The 35S 

promoter derivative 2B4A1, containing multiple copies of the B and Al enhancers from 
the CaMV 35S promoter, was cloned on a pUC21 plasmid containing the Hpt gene driven 
by a deletion derivative of the FMV 34S promoter ( 3AS-Hpt) and the gusA reporter gene. 
This complete plasmid (including pUC21 sequences) was digested at a unique Bglll site 

0 and cloned into the binary vector pSDM14 cut with BamHL The 35S promoter is now 
reading towards the right border sequence. 

Both plasmids (binary and ternary vectors) were introduced into Agrobacterium 
tumefaciens strain LBA4404 via triparental mating using pRK2013 as a helper strain. 

5 4.2 Transformation of Catharanthus roseus suspension cultured cells 

Agrobacterium tumefaciens strains were grov^ for 3 days at 28°C on solid AB 
medium containing 20 |-ig/ml rifampicin, 100 fig/ml kanamycin and 75 \i%/m\ 
chloramphenicol. From these bacteria a liquid overnight culture was grown in the same 
medium. Cells were harvested by centrifligation and resuspended at a OD600 of 1 in LS 
medium of pH 5.2 supplemented with 2 mg/1 NAA, 0.2 mg/1 kinetin, 3% sucrose, 1% 
glucose and 100 pM acetosyringone (hereafter called cocultivation medium). 

C roseus cells (cell line BIX) are grown in LS medium supplemented with 3% 
sucrose, 0.2 mg/1 kinetin and 2 mg/1 NAA (hereafter called LSI 3) at 25°C (16/8 h 
light/dark cycle) and subcultured every 7 days by transferring 7.5 ml of cells in 50 ml 
medium. For transformation 7 days old cells were used. The culture medium was replaced 
by cocultivation medium and the cell density was adjusted to 50% sedimentation volume. 
On a petridish (diam. 80 mm) containing 25 ml of cocultivation medium solidified with 
0.7% plant tissue culture agar, 7.5 ml of this C roseus suspension was mixed with 750 nl 
Agrobacterium tumefaciens culture. Dishes were taped using leucopor tape and put at 25°C 
in the dark for 3 days. After the cocultivation the cells were washed and plated on a paper 
filter disc (diam. 75 mm, Whatmann, No.4). These filters were placed in petridishes 
containing 25 ml LSI 3 with 50 pg/ml hygromycin-B, 400 pg/ml cefotaxim, 100 ^xg/ml 
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vancomycin and 0.4 mM 4-methyl-tryptophan. Filters were transferred onto fresh medium 
weekly. 

4 1 Isolation of a 4-mT resistant C. roseus line with increased TIA b iosvnthetic gene 
5 expression 

An estimated number of 400,000 to 500.000 stable, independent C roseus 
transformants were generated by Agrobacterium tumefaciens-mediated transformation 
with the tagging construct Tag-2B4AL Transformants were placed on selective medium 
containing 50 ^ig/ml hygromycin-B, 400 |ig/ml cefotaxim, 100 |ag/ml vancomycin and 0.4 
0 mM 4-mT. After several weeks of selection 281 independent 4-mT-resistant calli were 
obtained. These calli were transferred into liquid medium containing 50 ^ig/ml 
hygromycin-B and 0.2 mM 4-mT. From these suspension cultures RNA was extracted. 
Northern blot analysis showed that several mutants had an increased Tdc mRNA level 
(data not shown), which in all likelihood is causing the 4-mT resistance. One these mutants 
(line 46) is described in more detail below. 

For line 46 Northern blot analysis was performed using other genes from the TIA 
biosynthetic pathway. RNA extraction and Northern blot analysis were performed. Ten ^g 
RNA samples were loaded onto the gels. As 32P-labeled probes, cDNAs encoding TDC, 
STR and SGD were used. 

Besides Tdc, also Str\ and Sg mRNA levels were increased significantly in the 
tagged cell line 46, when compared to a 4-mT resistant control cell line (number 38) 
(Fig.6A). Thus in line 46, the T-DNA tag appears to have activated a central regulator of 
TIA biosynthetic gene expression, thereby inducing the expression of at least three genes, 
and possibly the complete TIA biosynthetic pathway. Chromosomal DNA was extracted 
from 50 gram fresh weight of cells from suspension line 46 and isolated by CsCl- 
Ethidiumbromide ultracentrifigation. Southern blots were made and the gusA gene was 
used as 32P-labeled probe isolated from GusSH, Southern blot analysis of line 46 showed 
that only one T-DNA copy was integrated into the plant genome (data not shown). 

For plasmid rescue, 10 jag genomic DNA of line 46 was digested with Xbal (a 
unique site in tagging construct Tag-2B4A1), ligated and transformed to electrocompetent 
cells of E.coli strain NM554. 
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Plasmid analysis of the obtained E.coli transformants identified the rescued plasmid 
46X8, which carried 1.6 kb of plant DNA immediately flanking the right border of the 
tagging construct. Using the rescued sequence as a probe on a Northern blot of RN A 
isolated from line 46 and control line 38, one hybridising mRNA species was observed 
5 (Fig.6B). This transcript was absent in control cell line 38, whereas it was present at a high 
level in line 46. 

Sequence analysis of plasmid 46X8 revealed an open reading frame (ORF), located 
at approximately 600 bp downstream of the start of the rescued plant sequence. Homology 
searches in databases revealed one AP2 domain present in the predicted amino acid 

10 sequence of this ORF, 

AP2 domain containing proteins are a class of transcription factors exclusively 
found in plants. The encoded protein was called ORCA-3 (for octadecanoid-responsive 
Catharanthus AP2 domain). Several AP2 domain transcription factors, all containing one 
conserved AP2 domain, have been shown to participate in stress signalling (Pti, EREBP, 

15 CBFl, DREB). CBFl has been demonstrated to act as central regulator of cold-induced 
gene expression in Arabidopsis thaliana. In analogy, the tagged gene described here, 
appears to encode a central regulator of TIA biosynthetic gene expression. 

The sequence rescued on plasmid 46X8 is of genomic origin, and may be 
interrupted by intron sequences. To isolate a cDNA copy of the mRNA transcribed from 

20 the tagged gene, a PGR approach was followed. Primer sequences were based on DNA 
sequence information from the rescued plasmid. Primer ORl (5'- 

GAATTCATATGGCGGAAAGCTGTCAGGAGGATTC: SEQ ID NO: 18) was combined 
with the vector T3 primer and OR4 (5'-CGACGTCGTAGAAGGCTCCGCAGGG; SEQ 
ID NO: 19) with the vector Ml 3-40 primer in PGR reactions using lambda DNA prepared 

25 from a C roseus root cDNA library as a template to isolate the 3' respectively 5* parts of 
the ORCA3 cDNA. Based on DNA sequence data from these partial cDNA clones, 
primers OR5 (5' AGATGTCATAGTTCCGAAGAAATC-ATTTGCGTGTGAG; SEQ ID 
NO:20) and OR6 (5* AGACTCGTGAACTTTTTTGGATATAAAATTTTGTAG-ATTCC; 
SEQ ID NO:21) were developed, which were used in a PGR on the rescued 46X8 plasmid 

30 to obtain the complete ORCA-3 open reading frame. The cDNA and amino acid sequences 
are provided in SEQ ID NO:3 and SEQ ID NO:6, respectively. Homology with other AP2 
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domain proteins, including the Catharanthus proteins ORCA-1 and ORCA-2 described 
above, is restricted to the AP2 domain. Sequences outside this domain are not conserved. 

EXAMPLE 5 

5 The tagged ORCA-3 gene is induced by elicitor and methyl -ia smonate 

Previous research showed that the octadecanoid pathway, leading to the synthesis 
of jasmonate, is involved in elicitor-induced expression of TIA biosynthetic genes (Menke 
et al., 1999). Treatment of C. roseus cells with methyl-jasmonate (MJ) thus resulted in a 
transient increase of both Tdc and Str\ mRNA levels. 

10 The T-DNA tagging experiment in Example 4.3 showed that the Tdc, Str\ and Sg 

genes are regulated by ORCA-3. Based on the fact that one other Catharanthus AP2 
domain protein, ORCA2, is involved in the elicitor- and jasmonate-responsive expression 
of the Str\ gene (Example 2), it was speculated that ORCA-3 may be a second elicitor- and 
jasmonate-responsive transcription factor. 

15 Northern blots were loaded with RNA from C roseus cells treated for various time 

points with MJ, crude yeast extract elicitor (YE) and with a partly purified elicitor from 
yeast extract (PE). Subsequently, blots were probed with plasmid 46X8, containing the 
tagged gene. Treatment of cells with MJ resulted in a rapid increase in expression of the 
tagged gene (Fig.7). This increase was already observed within 2 h after addition of MJ 

20 and sustained for at least 28 h. The induction of Tdc and Str\ expression was slower 

(maximal at 4 h after treatment) and was also maintained for at least 28 h (Menke et al,, 
1999). Because of the induction by a product from the octadecanoid pathway, the tagged 
gene was named Orca-3 (Octadecanoid Responsive Catharanthus AP2-domain protein). 
Incubation of cells with crude (YE) and partly purified (PE) elicitor (Fig.7) resulted in a 

25 rapid and transient increase of Orca-S mRNA levels, although the expression level is low 
relative to MJ-induced levels. The maximal induction of gene expression is at 
approximately 2 h after addition of the elicitor. The maximal induction o^Tdc and Strl 
expression by the elicitors occurs later (maximal induction after 6-8 h; Menke et al., 1999). 
The Northern blot analysis shows that Orca-3 expression is elicitor- and jasmonate- 

30 responsive and precedes the induction of the TIA biosynthetic genes, indicating that 
ORCA-3 is an elicitor- and jasmonate-responsive transcription faptor that coordinately 
controls TIA biosynthetic gene expression. 
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10 



EXAMPLE 6 

Binding site of PRC A- 1 and 2 within the RV region of Str promoter 

To identify the ORCA binding site within the RV region, Yl 87 yeast derivatives 

that contained tetramers of the RV wildtype fragment and mutants Ml through M6 (see 

below and Table 1.1 for mutations) fused to the HIS3 gene were constructed via single 

cross-over in the PDC6 locus as described above. 

The sequence of the RV region (SEQ ID NO:7) of the Str I promoter is shown 

below. Mutated blocks are underlined or indicated in bold face. Nucleotides within each 

block were changed into their complementary nucleotides. 

M2 M4 M6 

GT ACATCA CTCTTA GACCGC CTTCTT TGAAAG TGATTT CCCTTG GACC 
Ml M3 M5 M7 



15 The yeast strains were then separately transformed with pACTII derivatives RV124 

(encoding ORCA-1 fused to the GAL4 activation domain) and RV210 (encoding part of 
ORCA-2 including the AP2 domain fused to the GAL4 activation domain). Transformants 
were plated on SD minimal medium without leucine but containing histidine. Colonies 
obtained after 5 days were streaked on SD plates without histidine and leucine, and growth 

20 was recorded after 5 days (Table 6. 1 ). Growth indicates that ORCA proteins bound the RV 
derivative and activated HIS3 gene transcription, whereas lack of growth indicates that 
ORCA proteins failed to bind the RV derivative. The results indicate that ORCA-1 and 2 
failed to confer growth in yeast containing mutants M3 and M4 fused to the HIS3 gene. In 
these mutants, blocks of 6 nucleotides are changed in their complementary nucleotides 

25 within the sequence GACCGCCTTCTT. This sequence is similar to the binding site 

identified for tobacco EREBP proteins (Ohme-Takagi and Shinshi, Plant Cell 7:173-182, 
1995) and tomato Pti proteins (Zhou et al., EMBO J. 16, 3207-3218,1997) within PR gene 
promoters and for Arabidopsis CBF and DREB proteins within COR gene promoters 
(Stockinger et al., Proc.Acad.Sci.USA, 94:1035-1040,1997). 

30 In addition, ORCA-2 failed to confer growth in yeast containing mutant M2 fused 

to the HIS3 gene. 
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TABLE 6.1 

Growth of veast Yl 87 derivatives containing different RV mutants ftised to the HIS3 gene 
transformed with QRCA-1 or ORCA-2 expressing pACTII plasmids. 



RV derivative 


ORCA-1 (RV124) 


ORCA-2 (RV210) 


RV 




+ 


M2 


+ 




M3 






M4 






M5 


+ 


+ 


M6 


+ 


+ 



EXAMPLE 7 

ORCA-1 and 2 are transcriptional activators of the .S/rl -promoter in C. roseus cells 

To investigate whether ORG A- 1 and 2 can act as transcriptional activators 
interacting with the S'/rl -promoter, C. roseus cells were transiently cotransfomied with a 
S'/rl-promoter-gw5^ fusion (BH-GusSH) construct , an overexpression vector (pMOGI84) 
carrying Orcal or Orcal cDNAs fused in sense or antisense orientation to the CaMV 35S 
promoter and a overexpression vector carrying the chloramphenical acetyl transferase 
gene {cat). As a control, cotransformation of BH-GusSH with an empty overexpression 
vector was done. The CaMV 35S-cat construct served as an internal control for 
transformation efficiency. Cells were transformed through particle bombardment as 
described before, using the three constructs in a ratio of 1:1 :3 (gus:cat:orca). Twenty four 
hours after transformation, cells were harvested and frozen in liquid nitrogen. GUS and 
CAT activity assays were performed and the Gus reporter gene expression was expressed 
as a GUS/CAT ratio to correct for transformation efficiency. Expression of Orca-l in 
sense orientation resulted in a higher GUS activity (gus/cat ratio, 1.6 fold induction) as 
compared to the control (BHGusSH +pMOG 184) (Fig. 8). Expression of Orca-1 in 
antisense orientation did not alter the GUS activity level (Fig. 8). Expression of Orca-2 in 
sense orientation resulted in a 3.3 fold induction of GUS activity, wKereas expression of 
Orca'2 in antisense orientation had little or no effect on GUS activity as compared to the 
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control transformed cells (BHGusSH +pMOG 184) (Fig. 8).These results show that 
transient overexpression of either ORCA-1 or 2 (in sense orientation) enhances the 
expression of the gusA reporter gene driven by the Str-promoter fragment BH. 

To show that ORCA-1 and 2 enhance the expression driven by the 1 -promoter in 
a sequence-specific manner, co-transformations were done using wild-type and mutant 
versions of the ^/r 1 -promoter (ftised to gusA). ORCA-1 enhanced the expression of the 
gus reporter gene to some degree through interaction with wild-type and mutant versions 1, 
2, 5 and 6 of the BH fragment of the 5//- 1 -promoter, as compared to the control pMOG184) 
(Fig. 9). ORCA-1 did not enhance the expression of reporter constructs driven by mutant 
versions 3 and 4 of the S/r 1 -promoter fragment BH (Fig. 9). ORCA-2 strongly enhanced 
the expression of gus reporter gene through interaction with wild type and mutant version 
1, and 6 of the BH fragment of the 5/r 1 -promoter (Fig. 9), as compared to the control 
(pMOG184). ORCA-2 enhanced gus gene expression weakly for mutants 2 and 5 (Fig. 9). 
ORCA-2 did not enhance the expression of reporter constructs driven by mutant versions 3 
and 4 of the 5/r-promoter fragment BH (Fig. 9). These results show that the activation of 
gene expression by ORCA-1 and 2 is sequence-specific and occurs through direct 
interaction with the 5/r 1 -promoter fragment RV. The sequence requirement for ORCA 
activation in plant cells is slightly different from that in yeast cells (see example 6). In 
yeast, mutations M2, M3 and M4 affect the ability of ORCA-2 to activate gene expression, 
whereas in plant cells mutations M5 additionally affects gene expression to some degree. 
These differences may reflect differences in binding in the two cell types, for instance due 
to interaction with different endogenous proteins. On the other hand, the transient co- 
bombardment data closely correlate with the results of the stable transformation (Example 
1), indicating that indeed lack of ORCA binding is the reason for lack of expression of 
mutants M2, 3, 4 and 5 in stably transformed lines. 

EXAMPLE 8 

ORCA-3 is a transcriptional activato r of the Tdc and Str\ p ro moters in C. roseus cells 
To investigate whether ORCA-3 activates expression from the Tdc- and Str\- 
promoters, a particle bombardment assay was performed. C. ro^ew^y cells were co- 
bombarded with the gitsA-rsponcT gene with or without an overexpression vector 
containing the Orca-3 cDNA in the sense orientation. The ^wjy^ -reporter gene was driven 
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by a fragment of the Tdc-promoXcr (-219 to +86 as referred to the translational start site) or 
the -promoter (-202 to -1 as referred to the translational start site). 

24 h after bombardment the cells were stained for GUS activity. A clear increase in 
the number and intensity of blue spots was observed when the g-wj^^ -reporter gene was co- 
bombarded with 35S-Orca3, when compared to a negative control in which the cells were 
co-bombarded with the empty overexpression vector. This enhancement was observed 
both with the TJc-promoter fragment (Fig. 1 1) as well as with the 5/rl -promoter fragment 
(Fig. 10) driving reporter gene expression. As a control, GUS activity from an unrelated 
promoter (6Tcyt) was not increased after co-bombardment with Orca-3 (data not shown). 
Thus in this transient transformation assay, Orca-3 activates gene expression from the Tdc- 
and iS/rl -promoter in a sequence-specific manner. 

EXAMPLE 9 

ORCA-K -2 and -3 activate expression of the iasmonate-responsive RV element from the 
^/rl -promoter 

The transient co-bombardment assay, as described above, was used to investigate 
whether the ORCA proteins activate gene expression of the jasmonate-responsive RV 
element from the iS/rl -promoter. C. roseus cells were bombarded with a ^w5^ -reporter 
gene driven by a tetramer of the RV element (4RV) in combination with the empty 
overexpression plasmid, 35S-Orcal , 35S-Orca2 or 35S-Orca3, The number and intensity 
of blue spots was increased clearly upon co-bombardment with one of the three Orca 
genes, when compared to control cells, which were co-bombarded with the empty vector 
(Fig. 12). The strongest activation of gusA expression was observed when Orca-2 or Orca- 
3 were used. However, to make statements about the relative strengths of activation of the 
3 different ORCAs, quantitative analysis is necessary, thereby correcting for differences in 
particle coating and bombardment efficiencies. In addition, the differences in activation 
may be due to differences in expression levels of the three ORCA proteins, instead of 
differences in activation strength. 
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EXAMPLE 10 

ORCA-3 does not activate gene expression of RV mutants M2 and M3. and weakly for 
mutants M4 and M5 

35S-Orca3 was co-bombarded with the 4RV mutant series (described in Example 
5 1) to determine which sequences in the RV element were importamt for the activation by 
OrcaJ, After co-bombardment cells were stained for GUS activity and blue staining was 
quantified visually by estimating the number and intensity of blue spots. The results are 
shown in the below Table 10.1. 



TABLE 10.1 

Relative GUS activities as determined visually 



Cells bombarded 
with 


Relative GUS staining 


4RV + pMOG 


2 


2 


2 


4RV + 35S-Orca3 


4 


4 


4 


4M2 + pMOG 


1 


1 


1 


4M2 + 35S-Orca3 


1.5 


1 


1 


4M3 + pMOG 


0.5 


0.5 


0.5 


4M3 + 35S-Orca3 


0.5 


0.5 


1 


4M4 + pMOG 


0.5 


0.5 


0.5 


4M4 + 35S-Orca3 


1 


1 


1 


4M5 + pMOG 


0.5 


0.5 


0.5 


4M5 + 35S-Orca3 


1.5 


1.5 


2 


4M6 + pMOG 


3 


2.5 


2.5 


4M6 + 35S-Orca3 


4 


4 


4 



The above Table 10.1 shows the relative GUS activities as determined visually. 
The C roseus cells were co-bombarded in triplicate with the gusA reporter gene driven by 
4RV or a mutant of this element together with an overexpression plasmid carrying the 
Orca3 cDNA OSS-OrcaS) or the empty vector (pMOG). Cells were stained for GUS 
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activity. The numbers in Table 10.1 are an estimation of the intensity and number of blue 
spots. Values range from 0 (no blue spots) until 4 (many intense blue spots). 

No activation of GUS activity was observed when the gusA reporter gene was 
driven by a tetramer of RV mutant M2 or M3. For mutants M4 and M5, the basal GUS 
5 activity was lower, when compared to the wild-type. However, ORCA-3 was able to 

activate gene expression of these mutants. Expression of mutant M6 resembled the wild- 
type, both in basal GUS activity levels, as well in inducibility by ORCA-3. Thus ORCA-3 
is able to activate gene expression from the RV element and sequences that are disrupted 
in mutants 2 and 3 are important for this sequence specific activation. Again, quantitative 
0 analysis is necessary to make conclusive statements about the exact sequence requirements 
for activation of gene expression. The preliminary results indicate that the sequence 
requirements for activation of gene expression in plant cells by ORCA-3 are the same as 
for ORCA 2. 

EXAMPLE 1 1 

ORCA3 protein contains an acidic region that has an activating function, and a serine-rich 
region that has a negative regulatory function in transcription 

ORCA3 contains an acidic region preceding the AP2-domain. Acidic domains 
often function in transcriptional activation. In addition, ORCA3 contains a serine-rich 
region C-terminal of the AP2-domain (Fig. 13 A). The in vivo function of these regions was 
assayed by testing the activation potential of different ORCA3 derivatives on Str promoter 
activity (Fig.l3B). The i'/r 1 -promoter (-202 to -1 respective to the translation start site) 
ftised to the gusA gene was cobombarded with different ORCA3 derivatives (shown in 
Figure 13 A), cloned in sense orientation in the plant expression vectors pRTlOl or 
pRT104 (containing the CaMV 35S promoter). Full-length ORCA3 protein activated the 
Strl promoter about 5-fold (Figure 13B). Deletion of the acidic region of ORCA3 
(resulting in construct A50RCA3) abolished most of the activation. This reduction was not 
caused by decreased promoter binding in vivo nor to impaired nuclear localization of 
A50RCA3 protein, since activation was restored by fusion of a heterologous acidic 
activation domain from Herpes simplex viral protein VP 16 (amino acids 413-487) to 
A50RCA3. Deletion of the serine-rich region (construct A30RCA3) resulted in higher 
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activation. This indicates that the acidic region has an activating effect on transcription, 
whereas the serine-rich region has a repressing effect, 

EXAMPLE 12 

Overexpression of ORCA3 up-regulates multiple genes in primary as well as secondary 
metabolism 

In line 46, the OrcaS gene was activated by insertion of a T-DNA activation tag. 
Gene expression in the tagged cell line 46 overexpressing OrcaS was analysed by Northern 
blotting and hybridization with different cDNA clones. This showed that the TIA 
biosynthetic genes Tdc, S(r, Sgd^ and D4h were expressed at high levels, while the 
expression of GlOh and Dat was not affected (Fig.I4A). In addition, genes encoding the 
alpha subunit of AS {Asa) and DXS, enzymes involved in primary metabolism producing 
precursors of the indole and terpenoid moiety of TIAs, respectively, were induced by 
OrcaS overexpression, whereas two other primary metabolic genes, that are not involved 
in production of TIA precursors (geranylgeranyl pyrophosphate synthase, Ggpps; and 
isochorismate synthase, Ics) were not regulated by ORCA3 (Fig. 14A). 

The rescued OrcaS gene fused to the 2B4A 1 plant promoter was introduced in 
C.roseus cells by particle bombardment to generate stably transformed cell lines that 
overexpressed the OrcaS gene (03-0X cell lines). The 03-0X lines had higher expression 
levels of the endogenous Tdc, Str, Cpr and D4h genes (Fig.l4B and D). Due to the variable 
expression of Sgd in the different lines, no statistically significant effect of OrcaS 
overexpression was observed in the 03-0X lines (Fig.MB and D). The GlOh gene was not 
activated by ORCA3 overexpression (Fig.3B and D). In addition to the genes mentioned, 
also anthranilate synthase (Asa) and D-l-deoxyxylulose-5-phosphate synthase (Dxs) were 
expressed at higher levels in the 03-0X lines (Fig. 14B and D). This indicates that ORCA3 
not only regulates multiple genes in the TIA pathway, but in addition genes in primary 
metabolism that are involved in production of TIA precursors. Although it has been shown 
that genes fD4h and Dat) in the later part of the pathway leading to vindoline and dimeric 
alkaloids have a different cell-type-specific regulation compared to genes (Tdc and Str) in 
the earlier part of the TIA pathway, the D4h gene also shows increased expression in 03- 
OX lines compared to control lines (Fig.MB and D). 
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EXAMPLE 13 

Overex pression of PRC A3 results in elevated tryptophan, trvpt amine and TIA production 

Metabolite levels were measured in five representative 03-0X lines and in 5 
control lines. The 03-OX lines contained increased levels of tryptophan and tryptamine 

5 (Fig. 15, left panel). The 03-0X cell lines did not accumulate alkaloids in standard LS-13 
growth medium (Fig. 15, left panel), probably because the GlOh gene was not activated by 
ORCA3 overexpression. Upon feeding of a secologanin precursor (loganin), increased 
amounts of strictosidine (8,1 1 ± 0.87 ^moies/g DW), ajmalicine (0.25 ± 0.10 ^moles/g 
DW; collectively termed TIAs in Fig. 15, right panel) and an unidentified lochnericin-like 

10 TIA (1.21 ± 0.66 area units/g DW) accumulated compared to control lines. 

EXAMPLE 14 

Overex pression of ORCA2 up-regulates Tdc. Sir J and GlOh gene express ion 
15 The Orca2 gene, ftised to the CaMV 35S promoter in vector pMOG184, was 

introduced in C.roseus cells by particle bombardment to generate stable transgenic lines 
that overexpress the Orca2 gene (02-0X lines). RNA analysis showed that the genes Tdc 
and Str showed increased expression (Fig. 16), although the effect on Tdc was marginal. 
Surprisingly, the GlOh gene, which was not expressed in 03-0X Unes (Fig.MB and D), 
20 was highly expressed in 02-0X lines (Fig. 1 6). 



